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ABSTRACT 


/ 


This  paper  reviews  the  state  of  the  art  In  entmeratlve  solution  methods 
for  tL.  traveling  salesman  problem  (TSP).  The  Introduction  (Section  1) 
discusses  the  main  ingredients  of  branch  and  botmd  methods  for  the  TSP. 
Sections  2,  3  and  4  discuss  classes  of  methods  based  on  three  different  re¬ 
laxations  of  the  TSP;  the  assignment  problem  with  the  TSP  cost  function,  the 
1-tree  problem  with  a  Lagrangean  objective  function,  and  the  assignment 
problem  with  a  Lagrangean  objective  function.  Section  5  briefly  reviews  some 


other  relaxations  of  the  TSP,  while  Section  6  discusses  the  performance  of 
some  state  of  the  art  computer  codes.  Besides  material  from  the  literature, 
the  paper  also  Includes  the  results  and  statistical  analysis  of  some  computa¬ 


tional  experiments  designed  for  the  purposes  of  this  review. _ _ 
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1.  Introduction 


Since  the  first  attempt  to  solve  traveling  salesman  problems  by  an  enumer- 
ative  approach,  apparently  due  to  Eastman  [1938],  many  such  procedures  have  been 
proposed.  In  a  sense  the  TSP  has  served  as  a  testing  ground  for  the  development 
of  solution  methods  for  discrete  optimization,  in  that  many  procedures  ank,  devices 
were  first  developed  for  the  TSP  and  then,  after  successful  testing,  extended  to 
more  general  integer  programs.  The  term  "branch  and  bound"  itself  was  coined  by 
Little,  Murty,  Sweeney  and  Karel  [1963]  in  conjunction  with  their  TSP  algorithm. 

Enumerative  (branch  and  bound,  implicit  enumeration)  methods  solve  a  dis¬ 
crete  optimization  problem  by  breaking  up  its  feasible  set  into  successively 
smaller  subsets,  calculating  bounds  on  the  objective  function  value  over  each 
subset,  and  using  them  to  discard  certain  subsets  from  further  consideration. 

The  bounds  are  obtained  by  replacing  the  problem  over  a  given  subset  with  an  easier 
(relaxed)  problem,  such  that  the  solution  value  of  the  latter  bounds  that  of  the 
former.  The  procedure  ends  when  each  subset  has  either  produced  a  feasible 
solution,  or  was  shown  to  contain  no  better  solution  than  the  one  already  in 
hand.  The  best  solution  found  during  the  procedure  is  a  global  optimum. 

For  any  problem  P,  we  denote  by  v(P)  the  value  of  (an  optimal  solution 
to)  P.  The  essential  ingredients  of  any  branch  and  bound  procedure  for  a  dis¬ 
crete  optimization  problem  P  of  the  form  miarf(x)|x  €  S}  are 

(i)  a  relaxation  of  P,  i.e.  a  problem  R  of  the  form  min''g(x)lx  i  T], 
such  that  SCT  and  for  every  x,y€S,  f(x)  <  £(7)  implies  g(x)  <  g(y). 

(ii)  a  branching  or  separation  rule,  i.e.  a  rule  for  breaking  up  the 
feasible  set  of  the  current  subproblem  P^  into  subsets 

q 
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(iii)  a  lower  bounding  procedure,  i.e,  a  procedure  for  finding  (or 

approximating  from  below)  v(Rj^)  for  the  relaxation  of  each 
subproblem  ;  and 

(iv)  a  subproblem  selection  rule,  i.e.  a  rule  for  choosing  the  next 
subproblem  to  be  processed. 

Additional  ingredients,  not  always  present  but  always  useful  when  present, 
are 

(v)  an  upper  bounding  procedure,  i.e.  a  heuristic  for  finding  feasible 
solutions  to  P;  and 

(vi)  a  testing  procedure,  i.e.,  a  procedure  for  using  the  logical  implications 
of  the  constraints  and  bounds  to  fix  the  values  of  some  variables 
(reduction,  variable  fixing)  or  to  discard  an  entire  subproblem 
(dominance  tests). 

For  more  information  on  enumeratlve  methods  in  Integer  programming  see, 

for  instance,  Chapter  4  of  Garflnkel  and  Nemhauser  [1972] ,  and/or  the  surveys 
by  Bales  [1975],  Bales  and  Gulgnard  [1979],  Beale  [1979],  Speilberg  [1979]. 

Since  by  far  the  most  important  ingredient  is  (i),  we  will  classify  the 
branch  and  bound  procedures  for  the  TSP  according  to  the  relaxation  that  they 
use. 

The  integer  programming  formulation  of  the  TSP  that  we  will  refer  to  when 
discussing  the  various  solution  methods  is  defined  on  a  complete  directed  graph 
G  ■  (V,A)  on  n  nodes,  with  node  set  V  -  {l,...,n},  arc  set  A  *  {(i,j)(i,j  -  1,... 
and  nonnegative  costs  c^^  associated  with  the  arcs.  The  fact  that  G  is 
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complete  Involves  no  restriction,  since  arcs  that  one  wishes  to  ignore  can  be 
assigned  the  cost  c^^^  ■  *.  In  all  cases  c^^^  *  *,  V  i^V.  The  TSP  can  be 
formulated,  following  Dantzig,  Fulkerson  and  Johnson  [1954] ,  as  tha  problem 


(1)  min  Z  Sc  x 

l€Vj€V 

s.t. 


(2) 

2  X  -  1  ,  l€V 

J€V 

r  X.  .  -  1  ,  j€V 

IcV 

(3) 

“  “  ‘'-I  <  ^  1  "1  , 

t€Sj€3 

V  S  V,  S« 

(4) 

x^j  »  0  or  1,  i,j€V, 

where  x^^^  -  I  If  «rc  (l,j)  is  in  the  solution,  ■  0  otherwise. 


The  subtour  elimination  inequalities  (3)  can  also  be  written  as 


(5)  Z  Z  X  >  1  ,  V  S  c  V,  3,40 

i€Sj€\'\S  J  ^ 


A  very  important  special  case  is  the  symmetric  TSP,  in  which  c^j  "  ‘^jt’ 


Vi,j  . 

The  syrnaetrlc  TSP  can  be  defined 

on  a  con^Iete  undirected  graph  G  ■(V,E) 

on  n 

nodes,  with  node  set  V,  edge  set 

E,  and  arbitrary  costs  c^j.  It  can 

be  stated  as 

(6) 

s.t. 

min  2  Z  c  X 
ic7j>i 

(7) 

2  X  +  2  X  -  2  , 
j<i  ^  j>i 

i€v 
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(8)  S  S  i:  .  <  ]sl-l,  V  S  C  V, 

i€Sj€S  ^ 

i>i 

(9)  x^j  -  0  or  1,  i»j€V  ,  j  >i 

where  the  subtour  elimination  Ineqtialities  (8)  can  also  be  written  as 

(10)  2  2  X.,  +  2  2  X  >  2  ,  V  S  c  V.  S«l 

i€Sj€v\S  ^  i€V\Sj6S  ^ 

j>i  j>i 

Next  we  outline  two  versions  of  a  branch  and  bound  procedure  for  the  ISP. 
Prior  to  using  any  of  these  versions,  a  relaxation  R  of  the  TSP  must  be 
chosen.  Both  versions  carry  at  all  times  a  list  of  active  subproblems.  They 
differ  in  that  version  1  solves  a  (relaxed)  subproblem  only  when  node  k  is 
selected  and  taken  off  the  list,  while  version  2  solves  each  (relaxed)  sub¬ 
problem  as  soon  as  it  is  created,  l.e.  before  it  is  placed  on  the  list. 

Although  the  branch  and  bound  procedures  used  in  practice  differ  among  them¬ 
selves  In  many  details,  nevertheless  all  of  them  can  be  viewed  as  variants  of 
one  of  these  two  versions. 

Branch  and  bound  method  for  the  TSP 
Version  1 

1.  (initialization).  Put  TSP  on  the  list  (of  active  subproblems).  Initia¬ 
lize  the  upper  bound  at  U  ■  and  go  to  2. 

2.  (Subproblem  selection).  If  the  list  is  estpty,  stop:  the  tour  associated 

with  U  is  optimal  (or,  if  U  »  •,  TSP  has  no  solution).  Otherwise  choose 
a  subproblem  TSP^^  according  to  the  subproblem  selection  rule,  remove  TSpj 
from  the  list,  and  go  to  3. 
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3.  (Lower  botinding).  Solve  the  relaxation  of  TSP^  or  bound  v(R^)  from 

below,  and  let  be  the  value  obtained. 

If  >  U,  return  to  2. 

If  <  U  and  the  solution  defines  a  tour  for  TSP,  store  It  In  place 

of  the  previous  best  tour,  set  U  •-  L^,  and  go  to  5. 

If  <  U  and  the  solution  does  not  define  a  tour,  go  to  4. 

4.  (Upper  bounding:  optional).  Use  a  heuristic  to  find  a  tour  In  TSP.  If 
a  better  tour  Is  found  chan  the  current  best,  store  It  In  place  of  the 
latter  and  update  U.  Go  to  5. 

5.  (Reduction:  optional).  Remove  from  the  graph  of  TSP^  all  the  arcs  whose 
Inclusion  In  a  tour  would  raise  Its  value  above  U,  and  go  to  6. 

0.  (Branching).  Apply  the  branching  rule  to  TSP^,  l.e.  generate  new 
subprobletos  TSP^^j^,...,  place  them  on  the  list,  and  go  to  2. 

Version  2 

1.  (Initialization).  Like  In  version  1,  but  solve  R  before  putting  TSP 
on  the  list. 

2.  (Subproblem  selection).  Same  as  In  version  1. 

3.  (Upper  bounding:  optional).  Same  as  step  4  of  version  1,  with  "go  to  5" 
replaced  by  "go  to  4." 

4.  (Reduction:  optional).  Same  as  step  5  of  version  1,  with  "go  to  6"  replaced 
by  "to  go  5." 

5.  (Branching).  Use  the  branching  rule  to  define  the  set  of  subproblems 
TSPj^j^,.,.,TSP^q  to  be  generated  from  the  current  subproblem  TSP^, 
and  go  to  6. 
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».  (Lower  bovmding).  If  all  the  aubproblens  to  be  generated  from  TSP^ 
according  to  the  branching  rule  have  already  been  generated,  go  to  2. 
Otherwise  generate  the  next  aubproblem  TSP^j  defined  by  the  branching 
rule,  solve  the  relaxation  of  TSP^^  or  bound  v(R^j)  from  below, 

and  let  L^j  be  the  value  obtained. 

If  L^j  >  U,  return  to  6. 

If  <  U  and  the  solution  defines  a  tour  for  TSP,  store  it  in  place 
of  the  previous  beat  tour,  set  U  •-  and  go  to  6. 

If  <  U  and  the  solution  does  not  define  a  tour,  place  TSP^^  on  the  list 

and  return  to  6. 

In  both  versions,  the  procedure  can  be  represented  by  a  rooted  tree  (search 
or  branch  and  bound  tree)  whose  nodes  correspond  to  the  subproblems  generated, 
with  the  root  node  corresponding  to  the  original  problem,  and  the  successor  nodes 
of  a  given  node  i  associated  with  TSP^  corresponding  to  the  subproblems 
TSP^j^, . . .  ,TSPj^q  defined  by  the  branching  rule. 

It  is  easy  to  see  that  under  very  mild  assumptions  on  the  branching  rule 
and  the  relaxation  used,  both  versions  of  the  above  procedure  are  finite  (see 
Exercise  1). 

Next  we  discuss  various  specializations  of  the  procedure  outlined  above, 
classified  according  to  the  relaxation  that  they  use.  When  assessing  and 
comparing  the  various  relaxations,  one  should  keep  in  mind  that  a  "good"  re¬ 
laxation  is  one  that  (i)  gives  a  strong  lower  bound,  i.e.  yields  a  small 
difference  v(TSP)  -  v(R)  ;  and  (ii)  is  easy  to  solve.  Naturally,  these  are 
often  conflicting  goals,  and  in  such  cases  one  has  to  weigh  the  tradeoffs. 
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2.  Relaxation  I;  The  Assignment  Problem  with  the  TSP  Co3t  Ftr-ictlcn 

The  most  straightforward  relaxation  of  the  TSP,  and  historically  the  first 
one  to  have  been  used,  is  the  problem  obtained  from  the  integer  programnlng 
formulation  (1),  (2),  (3),  (4)  by  removing  the  constraints  (3),  i.e.  the 
assignment  problem  (AP)  with  the  same  cost  function  as  TSP.  It  was  used, 
amoog  others,  by  Eastman  [1958],  Little,  Murty,  Sweeney  and  Karel  [1963], 

Shapiro  [1966],  Bellmore  and  Malone  [1971],  Smith,  Srinivasan  and  Thompson 
[1977],  Carpaneto  and  Toth  [1980]. 

An  assignment  (i.e.,  a  solution  to  AP)  is  a  union  of  directed  cycles, 
hence  either  a  tour,  or  a  collection  of  subtours.  There  are  nl  distinct 
assignments,  of  which  (n-1) I  are  tours.  Thus  on  the  average  one  in  every  n 
assignments  is  a  tour.  Furthermore,  in  the  current  context  only  those  assign¬ 
ments  are  of  Interest  that  contain  no  diagonal  elements  (i.e.,  satisfy  x^^  « 

0,  i«l,**<,n),  and  their  number  is  n!/e  rounded  to  the  nearest  integer, 
i.e.  ln!/e  +  1/2]  (see,  for  example.  Hall  [1967],  p.  10).  Thus  on  the  average  one 
in  every  n/e  "diagonal-free"  assignments  is  a  tour.  This  relatively  high  fre¬ 
quency  of  tours  among  assignments  suggests  that  v(AP)  is  likely  to  be  a  pretty 
strong  bound  on  v(TSP),  and  computational  experience  with  AP-based  solution 
methods  supports  such  a  view.  To  test  how  good  this  bound  actually  Is  for 
randomly  generated  problems,  we  performed  the  following  experiment.  We  gen¬ 
erated  400  problems  with  50  <  n  <  250  with  the  costs  independently  drawn  from  a 
uniform  distribution  of  the  integers  over  the  internals  [1,100]  and  [1,1000], 
and  solved  both  AP  and  TSP.  We  found  that  on  the  average  v(AP)  was  99.27.  of 
v(TSP).  Furthermore,  we  found  the  bound  to  improve  with  problem  size,  in  that 
'  for  the  problems  with  50  <  n  <  150  and  150  <  n  <  250  the  outcomes  were  98.87. 
and  99.5%,  respectively. 
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b  solved  by  the  Hungerlan  method  (Kuhn  [19551  ;  for  «  ®ore  recent 
.  ri9751  or  Uwler  [19761)  in  at  most  O(n^)  steps. 

see  Chriatofidet 

creacmen  ,  solved  at  every  node  of  the  search  tree  differ 

The  assigment  problems  to 

f  om  the  initial  assignment  problem  AP  in  «t  some  arcs  are  excluded  (forbidden) 

—  included  (force)  into  the  solution.  These  modifi- 
from,  while  other  arcs  are  m 

difficulty,  see  the  Hungarian  nwthod  provides  at 

cations  do  not  present  any 

.4  «  a  lower  bound  onv(AP),  t  process  of  solving  a  subproblem  can 
every  iteration  a  lows. 

^  fVi.  lower  bound  meetshe  upper  bound  U.  More  importantly, 

be  stopped  whenever  tne 

/-  branching  rs  below),  the  assignment  problem  AP, 

in  the  typical  case  (see  the  j 

,  1  f  search  treiffers  from  the  problem  AP,  solved 

to  be  solved  at  node  J  of  tne  i 

.  in  that  a  c*in  arc  belonging  to  the  optimal  solution 

St  the  patM'  '■ 

.  J  rbe  solution  of,  and  possibly  some  other  arcs  are 

of  AP  is  excluded  from  tne  *04. 

^  sane  position  sr  out)  with  respect  to  the  solution 

required  to  maintain  the  sen  i- 

h  t  they  have  with  respect  to  of  AP^,  Whenever  this  is  the  case, 

^  be  solved  by  the  Htian  method  starting  from  the  optimal 

the  problem  APj  can 

.  u  .,..«hlen  at  the  parent  nor  at  a  brother  node),  in  at  most  OCn'^) 
solution  of  the  prooiea 

•t  se  2  or  Bellmore  and  b  (  1971]  ) .  For  an  efficient  implementa- 
steps  (see  Exercise 

f  this  version  of  the  Hungariaiod,  which  uses  on  the  average  consider- 

steps,  see  Carpend  Toth  [1980].  The  primal  simplex 
ably  less  than  0(n  )  steps, 

f  the  assignment  problem  has  sen  used  in  a  parametric  version 
fficiently  this  sequence  of  :lated  assignment  problems  by 

smith.  Stlnlm^an  and  Thompsou  115771. 

u  A  tr/AP)  can  be  slimproved  by  addition  of  a  penalty. 

The  lower  bound  v(Ar; 

u  «i/.„iated  as  the  minimal  e  in  the  objective  function 
This  can  be  calcuiaceu 

..  -  simplex  pivotliminates  some  arc  from  the 

caused  either  by  a  first  v 

f4r-gt  iteration  of  Urian  method  that  accomplishes  the  same 

solution,  or  by  a  ii* 

Furthermore,  the  arc  to  be  inc  the  solution  by  the  pivot  can  be 
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restricted  to  a  cutset  defined  by  some  subtour  of  the  AP  solution.  Computa* 
tional  experience  Indicates,  however,  that  the  impact  of  such  a  penalty  tends 
to  decrease  with  problem  size  and  is  negligible  for  anything  but  small  problems. 
In  the  con^utational  experiment  involving  the  400  randomly  generated  problems 
that  we  ran,  the  addition  of  a  penalty  to  v(AP)  raised  the  value  of  the  lower 
bound  on  the  average  by  0.03%,  from  99.27.  to  99.237.  of  v(TSP). 


Brariching  rules 

Several  branching  rules  have  been  used  in  conjunction  with  the  AP  relaxa- 
ation  of  the  TSP.  In  assessing  the  advantages  and  disadvantages  of  these  rules 
one  should  keep  in  mind  chat  Cho  ultimate  goal  is  to  solve  the  TSP  by  solving 

as  few  subproblems  as  possible.  Thus  a  "good"  branching  rule  is  one  that 
'a)  generates  few  successors  of  a  node  of  the  search  tree,  and  (b)  generates 

strongly  constrained  subproblems,  i.e.  excludes  many  solutions  from  each 
subproblem.  Again,  these  criteria  are  usually  conflicting  and  the  merits  of 
Che  various  rules  depend  on  the  tradeoffs. 

We  will  discuss  the  various  branching  rules  in  terms  of  sets  of  arcs 
excluded  (E^)  from,  and  Included 

k.  In  terms  of  the  variables  x^^ ,  the  interpretation  of  these  sets  is  that 
subproblem  k  is  defined  by  the  conditions 


(Ij^)  into  Che  solution  of  subproblem 


(11> 


/ 

j  0,  (i,j)  € 
|l,  (i,j)6l^. 


in  addition  to  (1),  (2),  (3),  (4).  Thus  the  relaxation  of  subproblem  k  is 
given  by  (11)  in  addition  to  (1),  (2),  (4).  We  abbreviate  Branching  Rule 
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MOi.  (Little,  Murty.  Sweeney  and  Karel  [1963]).  Given  the  current  re¬ 
laxed  subproblem  AP,  and  its  reduced  costs  -  v^.  where  u, 

and  Vj  are  optimal  dual  variables,  for  every  arc  (i,j)  such  that  c  -  0 
define  the  penalty 

Pij  ■  {'ih  =  '‘6v\tj)}  + 
and  choose  (r,8)  6  a  such  that 

"r.  ■  {"IJ  =  -  »}  • 

Then  generate  two  successors  of  node  k,  nodes  k  +  1  and  k  +  2,  by 


defining 


h+1  “®k  ‘Jf •  ife+i  -  \ 


^+2  “  \  \  . 

This  rule  does  not  use  the  special  structure  of  the  TSP  (indeed,  it  applies 

to  any  integer  program),  and  ha.  the  disadvantage  that  it  leaves  the 

optimal  solution  to  AP  feasible  for  AP 

^  .  ..  h+2* 

tt.  folWng  rule,  .r,  bawd  on  dl. Junction,  derived  froa  the  ,d.tour 
elimination  inequalities  (3)  or  (5), 

BR_24  (Eastman  [1958],  Shapiro  [1966])  Let 

V  o  Let  X  be  the  optimal  solution  to 

the  current  relaxed  subproblem  AP  and  let  A  -  f  (i  i  )  M  u 

•rc  set  of  .  mlnln«  oerdlnellty  euhtour  of  x’'  im-olvlng  the  node  eet 
S  “  Clj,...,lj).  Conetrelnt  (3)  for  s  Implle.  the  Inequellty 

^  <  |sl  -  1, 

(i,j)€Ag  J  -  '  ' 

which  in  turn  implies  the  disjunction 


'44  ■  “  ''  •  •  •  V  x^  j  .  0  . 

t  1 
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Generate  t  successors  of  node  k,  defined  by 


^+r  *  ^ 


^t+1  -  H  • 

Now  is  clearly  infeasible  for  all  r  -  l....,t.  and  the  choice 

of  a  shortest  subtour  for  branching  keeps  the  number  of  successor  nodes  small. 
However,  the  disjunction  (12)  does  not  define  a  partition  of  the  feasible  set 
of  AP^,  and  thus  different  successors  of  APj^  may  have  solutions  in  common.  This 
shortcoming  is  remedied  by  the  next  rule,  which  differs  from  BR  2  only  in  that 
it  strengthens  the  disjunction  (12)  to  one  that  defines  a  -jartirion, 

BRJLl  (Murty  [1968],  Bellmore-Malone  [1971],  Smith,  Srinivasan  and 
Thompson  [1977]).  The  disjunction  (12)  can  be  strengthened  to 

(U)  .  0)  .  1.  .  0,  .....  „ 

and  accordingly  (13)  can  be  replaced  by 


^+r  ■  \ 


with  -  ij^  . 

A  slightly  different  version  of  BR  3  (as  well  as  of  BR  2)  is  to  replace 
the  edge  sec  A„  of  a  minimum-cardinality  subtour  with  that  of  a  subtour  with  a 

d 
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xininum  number  of  free  edges  (I'.e,  edges  of  E\E^UI^).  This  rule  is  used  in 
Carpaneto  and  Toth  [1980], 

BR  4.  (Bellmore  and  Malone  [1971]),  Let  and  S  be  as  before. 
Constraint  (5)  implies  the  disjunction 

(16)  (x.  ,  -  0.  J€S)  V(x  -  0,  j€S)  V...V(x^  .  «  0,  j€S). 

!•* 

Generate  t  successors  of  node  k,  defined  by 

(17)  j>r.l....,t 

W  -  Ik  J 

Like  in  the  case  of  BR  2,  Br  4  makes  x^  infeasible  for  all  successor 
problems  of  AP,^,  but  again  (16)  does  not  partition  the  feasible  set  of  AP,  , 

IV 

This  is  remedied  by  the  next  rule,  which  differs  from  BR  4  only  in  that  it 
defines  a  partition. 

BR  5,  (Garfinkei  [1973]),  The  disjunction  (16)  can  be  strengthened  to 

(18)  (X  -  0,  j€S)  V(x  -  0,  J€V\S;  x  -  0.  j€S)  V... 

I-*  l-J  ^2^ 

V(**j  j  “  J®7\S,r  *  l,,.,,t~li  X,  j  ■  0,  j€S) 

r** 

and  accordingly  (17)  can  be  replaced  by 

\+r  j€S}  UC(i^,J):  q-l,....r-l,  j€V's.S} 

(19) 

^k+r  “  ^k  , 


Th«  two  rules  BR  2  end  BR  4  (or  their  strengthened  variants,  BR  3  and 

BR  5),  based  on  the  subtour-climlnatlon  constraints  (3')  and  (5),  respectively, 

« 

generate  the  same  ntuober  of  successors  of  a  given  node  k.  However,  the  rule 
based  on  Inequality  (5)  generates  more  tightly  constrained  subproblems ,  l.e., 
excludes  a  greater  nuniber  of  assignments  from  the  feasible  set  of  each  successor 
problem,  than  the  rule  based  on  Inequality  (3^).  Indeed,  with  |s|  >  k,  we  have 

Theorem  1.  (Bellmcre  and  Malone  [1971]).  Each  Inequality  (3^)  eliminates 
l(n-k)l/e  +  1/2 j  diagonal-free  assignments,  whereas  each  Inequality  (5)  eliminates 
l(n-k)!/e  +  1/2]  •  lk!/e  +  1/2]  diagonal-free  assignments. 

Proof.  Each  Inequality  (3^)  eliminates  those  diagonal-free  assignments 
that  contain  the  subtour  with  arc  set  A^.  There  are  as  many  such  assignments 
as  there  are  diagonal-free  assignments  In  the  complete  graph  defined  on  node 
set  V\S,  and  the  number  of  these  Is  (n-k)!/e  rounded  to  the  nearest  Integer,  l.e., 
l(n-k)!/e  +  1/2]  (see  section  2). 

On  the  other  hand,  each  Inequality  (5)  eliminates  those  diagonal-free 
assignments  consisting  of  the  union  of  two  such  assignments,  one  In  the  complete 
graph  defined  on  S,  the  other  In  the  complete  graph  defined  on  V\S.  Since  the 
number  of  the  latter  Is  l(n-k)l/e  +  1/2]  and  that  of  the  former  Is  [kle  -t-  1/2], 
the  number  of  diagonal-free  assignments  eliminated  by  each  Inequality  (5)  Is  as 
stated  In  the  theorem. || 

Nevertheless,  both  Smith,  Srlnlvasan  and  Thompson  [1977]  and  Carpaneto  and 
Toth  [1980]  found  their  respective  Implementations  of  BR  3  more  efficient  than 
BR  4  or  BR  5,  both  In  terms  of  total  computing  time  and  number  of  nodes  generated. 
We  have  no  good  explanation  for  this. 
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Othtr  f«ature8 

The  subproblem  selection  rule  used  by  many  branch  and  bound  algorithms  is  the 
one  known  as  "depth  first"  or  LIFO  (last  in  first  out).  It  amounts  to  choosing 
one  of  the  nodes  generated  at  the  last  branching  step  (in  order,  for  instance, 
of  nondecreaaing  penalties,  like  in  Smith,  Srinivasan  and  Thompson 

[1977]);  and  when  no  more  such  nodes  exist,  backtracking  to  the  parent  node 
and  applying  the  same  rule  to  its  brother  nodes.  This  rule  has  the  advantage 

of  modest  storage  requirements  and  easy  bookkeeping.  Its  disadvantage  is  that 
possible  erroneous  decisions  (with  respect  to  arc  exclusion  or  inclusion)  made 
early  in  the  procedure  cannot  be  corrected  until  late  in  the  procedure. 

The  alternative  extreme  is  known  as  the  "breadth  first"  rule,  which 
amounts  to  always  choosing  the  node  with  the  best  lower  bound.  This  rule  has 
the  desirable  feature  of  keeping  the  size  of  the  search  tree  as  small  as  possible, 
(sea  Exercise  3),  but  on  the  other  hand  requires  considerable  storage  space.  In 
order  to  keep  simple  the  passage  from  one  subproblem  to  the  next  one,  this  rule 
must  be  embedded  in  a  procedure  patterned  after  version  2  of  the  outline  in  the 

introduction,  which  solves  each  assignment  problem  as  soon  as  the  corresponding  node 
is  generated,  and  places  on  the  list  only  those  subproblems  TSP^^  with  <  U. 

The  procedure  of  Carpaneto  and  Toth  [1980]  uses  this  rule,  and  it  chooses  the 
subproblems  to  be  processed  (successors  of  a  given  node)  in  the  order  defined 
by  the  arc  adjacencies  in  the  subtour  that  serves  as  a  basis  for  the  branching. 

As  mentioned  earlier,  the  high  frequency  of  tours  among  assignments  makes 
AP  a  relatively  strong  relaxation  of  TSP,  which  in  the  case  of  random  (asynmetrlc) 
costs  provides  an  excellent  lower  bound  on  v(TSP).  However,  in  the  case  of 
the  symmetric  TSP,  the  bound  given  by  the  optimal  AP  solution  is  substantially 
weaker.  An  experiment  that  we  ran  on  140  problems  with  40  <  n  <  100  and  with 
symmetric  costs  independently  drawn  from  a  uniform  distribution  of  the  integers 
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in  Che  interval  [1,  1000],  showed  v(AP)  to  be  on  the  average  82%  of  v(TSP), 
while  the  addition  of  a  penalty  raised  the  bound  to  85%.  The  explanation  of  the 
relative  weakness  of  this  bound  is  pretty  stralghcfortMrd:  in  the  syniaecric  case, 
there  is  a  tendency  towards  a  certain  S3nametry  also  in  the  solution,  to  Che  effect 
that  if  ■  1,  then  (since  c^j  ■  one  tends  to  have  ■  1  too; 

and  thus  the  optimal  AP  solution  usually  contains  a  lot  of  subtours  of  length  2, 
irrespective  of  the  size  of  n  .  Thus  as  a  rule,  a  much  larger  number  of 
subtours  has  to  be  eliminated  before  finding  an  optimal  tour  in  the  symmetric 
case  chan  in  the  asymmetric  one.  Itils  makes  the  AP  a  poor  relaxation  for 
Che  symmetric  TSP. 

3.  Relaxation  II !  The  1-Tree  Problem  with  Lagranaean  Objective  Function 

This  relaxation  was  successfully  used  for  the  symnetric  TSP  first  by  Held 
and  Karp  [1970,  1971]  and  Christofides  [1970],  and  subsequently  by  Helbig  Hansen 
and  Krarup  [1974],  Smith  and  Thompson  [1977],  Volgenant  and  Jonker  [1982], 

Consider  the  symmetric  TSP  and  the  undirected  (complete)  graph  G  ■  C>%£) 
associated  with  it.  The  problem  of  finding  a  connected  spanning  subgraph  H 
of  G  with  n  edges,  that  minimizes  the  cost  function  (6),  is  obviously  a 
relaxation  of  the  symmetric  TSP.  Such  a  subgraph  H  consists  of  a  spanning 
tree  of  G,  plus  an  extra  edge.  We  may  further  restrict  H  to  the  class  m 
of  subgraphs  of  the  above  c/pe  in  which  some  arbitrary  node  of  G,  say  node  1, 
has  degree  2  and  is  contained  in  the  unique  cycle  of  H.  For  lack  of  a 
better  term,  the  subgraphs  of  this  class  J"  are  called  l-trees.  To  see  that 
finding  a  l>tree  that  minimizes  (6)  is  a  relaxation  of  the  TSP,  it  suffices  to 
realize  that  the  constraint  set  defining  the  fat.iily  ^  is  (9)  and 
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(20)  z  r  X  +  r  r  x  >  i,  v  s  ^  v, 

i€Sj£V\S  i£V\Sj€S  ^  * 

j>l  J>i 

(21)  Z  “  -  a 

iS7j>i 

(22)  “*14-2 

j€V 

Here  (20)  is  a  weakening  of  (10),  (21)  is  the  sum  of  all  equations  (7) 
divided  by  two,  and  (22)  is  the  first  equation  (7). 

The  minimum-cost  1-tree  problem  is  easily  seen  to  be  decanposable  into 
two  independent  problems : 

(a)  Co  find  a  minimum-cost  spanning  tree  in  G  -  [l};  and 

(9)  to  find  two  smallest-cost  edges  among  those  incident  in  G  with  node  1. 

The  n-2  edges  of  the  spanning  tree  found  under  (»),  together  with  the 

2  edges  found  under  (9),  form  a  minimum-cost  1-tree  in  G. 

Solving  problem  (j!)  requires  0(n)  comparisons,  whereas  problem  (cr) 

can  be  efficiently  solved  by  the  algorithms  of  Dijkstra  [1959]  or  Prim  [1957], 

2 

of  complexity  0(n  ),  or  by  the  algorithm  of  Kruskal  [1956],  of  complexity 
0(|e[  log  Ie{).  Since  the  log  ]e]  in  the  last  expression  comes  from  sorting 
the  edges,  a  sequence  of  subproblems  that  requires  only  minor  resorting  of  the 
edges  between  two  members  of  the  sequence  can  be  more  efficiently  solved  by 
Kruskal 's  procedure  than  by  the  other  two. 

The  number  of  1-trees  in  the  complete  undirected  graph  G  on  n  nodes  can  be 
calculated  as  follows:  the  number  of  distinct  spanning  trees  in  G  -  {1}  is 
(n-1)  (Galley's  formula),  and  from  each  spanning  tree  one  can  get  (  2  ^  distinct 
l-trees  by  inserting  two  edges  Joining  node  1  to  the  tree.  Thus  the  number  of 
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1-trees  in  G  is  j(n-2)  (n-l)*^"^ ,  which  is  much  higher  than  the  number  of  solu¬ 
tions  to  AP.  Since  G  has  (n-1)!  tours,  on  the  average  the  number  of  tours  among 

T  n-3 

the  l-trees  of  a  complete  undirected  graph  is  one  in  every  j(n-2)(n-l)  /(n-2)!, 

and  hence  the  minimum-cost  1-tree  problem  with  the  same  objective  function  as 
the  TSF  is  a  rather  weak  relaxation  of  the  ISP.  In  the  above  mentioned  computa¬ 
tional  experiment  on  140  randomly  generated  symmetric  problems,  we  also  solved 
the  corresponding  1-tree  problems  and  found  the  value  of  an  optimal  1-tree  to  be 
on  the  average  only  637.  of  v(TSP) .  However,  this  relaxation  can  be  considerably 
strengthened  by  taking  the  equations  (7)  into  the  objective  function  in  a  Lagrangean 
fashion,  and  then  maxlmlrlng  the  Lagrangean  as  a  function  of  the  multipliers. 

The  problem 


(23) 


im4  “n 


•  min  2  S  (c  ,  +  X.  +X,)x. . 
x€X(J)i€Vj>l  ^  ^  J 


» 


where  X  is  any  n- vector  and  X(J0  is  the  set  of  incidence  vectors  of  1- trees  in 
G,  i.e.,  the  set  defined  by  (9),  (20),  (21),  (22),  is  a  Lagrangean  relaxation 
of  the  TSP,  From  the  last  expression  in  (23)  and  the  fact  that  X(^  contains 
all  tours,  it  is  easy  to  see  that  for  any  X,  L(X)  <  v(TSP).  (For  surveys  of 
Lagrangean  relaxation  in  a  more  general  context  see  Geoff rlon  [1974],  Fisher 
[1981],  Shapiro  (1979].)  The  strongest  Lagrangean  relaxation  is  obviously  given 
by  X  *  X  such  that 

(24)  L(X)  -  max  L(X)  . 


rccblem  (24)  is  sometimes  called  a  Lagrangean  dual  of  the  TSP. 


Now  (24)  is  a  much  stronger  relaxation  than  the  l*tree  problem 
with  the  TSP  cost  function.  Indeed,  ccmiputational  experience  with  randomly 
generated  problems  has  produced  on  the  average  values  of  L(X)  of  about  997. 
of  v(TSP)  according  to  Christofldes  [1979]  (p.  134),  and  of  about  99.77. 
of  v(TSP)  according  to  Volgenant  and  Jonker  [1982]. 

However,  solving  (24)  is  a  lot  more  difficult  than  solving  a  1-tree 
problem.  The  objective  function  of  (24),  l.e.  the  function  L(X)  of  (23),  is 
piecewise  linear  and  concave  in  X.  Thus  L(X)  is  not  everywhere  differentiable. 
Held  and  Karp  [1971],  who  first  used  (24)  as  a  relaxation  of  the  TSP,  have  tried 
several  methods,  and  found  that  an  iterative  procedure  akin  to  the  relaxation  method 
of  Agmon  [1954]  and  Motzkin  and  Schoenberg  [1954]  was  the  best  suited  approach 
for  this  type  of  problem.  The  method,  which  turned  out  to  have  been  theoret¬ 
ically  studied  in  the  Soviet  literature  (see  Polyak  [1967]  and  others) 
became  the  object  of  extensive  investigations  in  the  Western  literature  under 
the  name  of  subgradiant  optimization,  as  a  result  of  its  successful  use 
by  Held  and  Karp  in  conjunction  with  the  TSP  (for  surveys  of  subgradient  opti¬ 
mization  in  a  more  general  context  see  Held,  Wolfe  and  Crowder  [1974], 

Sandi  [1979]). 

The  subgradient  optimization  method  for  solving  (24)  starts  with  some 

arbitrary  X  ■  X°  (say  the  zero  vector)  and  at  iteration  k  updates  X  as 

It  Ic  Ic 

follows.  Find  L(X  ),  i.e.  solve  problem  (23)  for  X  -  X  .  Let  H(X  )  be 

k  k 

the  optimal  1-tree  found.  If  H(X  )  is  a  tour,  or  if  v(H(X  ))  >  U, 
stop.  Otherwise,  for  i€V,  let  d^  be  the  degree  of  node  i  in  H(X  ). 

tf  If 

Then  the  n-vector  with  components  d^  -  2,  i€V,  is  a  subgradlent  of  L(X)  at  X 


(see  Exercise  4).  Set 
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(25)  +  t^(d^  -  2).  tev 

v 

where  t  is  the  "step  length"  defined  by 

(26)  t^  -  a(u  -  L(xS)/  2  (d^  -  2)^ 

i€V  ^ 

xfith  0  <  cr  <  2.  Then  set  k  •-  k+I  and  repeat  the  procedure. 

It  can  be  shown  (see  any  of  the  surveys  mentioned  above)  that  the  method 

Oft 

tp 

converges  if  It  ■  «  and  lim  t  ■  0.  These  conditions  are  satisfied  if 
k*l  k-« 

one  starts  with  a  *  2  and  periodically  reduces  a  by  some  factor. 

Example  1. 

Consider  the  8-city  symmetric  TSP  whose  graph  is  shown  in  Fig.  1  (only  arcs 
with  finite  cost  are  present).  Initially  U  ■  25,  a  »  2,  ‘a.°  ■  0  for  i  ■  1,...,8. 


Fig.  1.  Initial  graph  G  ■  (V,E) 
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Th®  optimal  1-tree,  shown  in  heavy  lines  in  Fig.  1,  has  a  weight  of  L(X°)  *  21. 
At  iteration  0  we  have: 

d°  -  (2,  2,  4,  1,  1.  3,  2,  1); 

t°  -  2(25-21)/8  -  1; 

xj  •  (0,  0.  2,  -1,  -1,  1,  0,  -1). 

The  updated  arc  costs  (c^^  +  xj  +  xj)  and  the  corresponding  optimal  1-tree, 
having  a  weight  of  L(X^)  -  24,  are  shown  in  Fig.  2. 


1  P 

We  have  d^  »  2  for  i  •  1 . 8;  thus  a  tour  has  been  found  and  the  procedure  stops. || 


Held  and  Karp  [1971]  pointed  out  that  if  is  taken  to  be,  instead  of 
0,  the  vector  defined  by 


where  (u,  v)  is  an  optimal  solution  to  the  dual  of  the  assignment  problem  with 
costs  c^j  ■  Cjj^,  V  i,j,  then  one  always  has  v(H(X°))  >  v(AP).  Indeed,  for  this 
choice  of  one  has  from  (23) 

L(X°)  -  min  2  2  (c.,  +  X°  +  X°)x..  -  2  2  X° 

x6X(w^  i€V  J>i  ^  i6V  ^ 


min  2  2  rl  (c.  -  u  -  v.)  +  (c  -  u,  -  v. )]  +  2  (u  +  v.) 

x€x(0  i€V  j>i  ^  ^  J  Ji  J  i  i  1 


>  v(AP), 


since  v(AP)  «  2  (u  +  v. )  and  c.  -  u  -  v.  >  0,  V  i,j. 

1  1  ij  i  J  - 

This  kind  of  initialization  requires  of  course  that  one  solve  AP  prior  to 
addressing  problem  (24). 

Helbig  Hansen  and  Krarup  [1974]  and  Smith  and  Thompson  [1977]  distinguish  be 
tween  the  application  of  the  subgradient  procedure  at  the  root  node  of  the 

f 

search  tree  and  at  subsequent  nodes,  by  using  different  starting  vectors  \° 
and  different  stopping  rules. 

Volgenant  and  Jonker  [1982]  use  an  updating  formula  for  X  ,  and  an  ex- 

V 

presslon  for  t  ,  different  from  (25)  and  (26),  respectively.  Namely,  they 

It 

take  t  to  be  a  positive  scalar  decreasing  according  to  a  series  with  a 
constant  second  order  difference,  i.e. 


-  2t^  +  t^“^  -  constant. 
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and  define 


,k+l 


by  setting  for  i£V, 


(28) 


.k+1  -  < 


if  d^  -  2 


+  0,6t^(d^-2)  +  0.4t^(d^  ^-2)  otherwise 


It  should  be  mentioned  that  none  of  the  versions  of  this  subgradient  optimization 
method  can  be  guaranteed  to  solve  (24)  in  polynomial  time  with  a  prespecified 
degree  of  accuracy.  However,  the  stopping  rules  are  such  that  after  a  certain 
number  of  iterations  the  procedure  terminates  with  an  approximation  to  an 
optimal  X,  which  gives  a  (usually  good)  lower  bound  on  L(X). 


Branching  rules 

5R  6,  (Held  and  Karp  [19711),  At  node  k,  let  the  free  edges  of  the  current 
l«tree  (i.e.  those  in  EN^U 

and  let  the  first  q  elements  of  this  ordered  set  be  j  »  C(ij^»Jj^) . (iq»jq)}» 

where  q  will  be  specified  below.  Define  q  new  subproblems  by 

*k+t  ■  ^  =  '■  ■  ' . 1 . . 

“Tt+r  ■  \ ^ . 

\+q  •  \  ••  i  -  P  or  J  -  p} 

Here  p  €  V  is  such  that  contains  at  most  one  edge  incident  with 
p,  while  contains  two  such  edges;  and  q  is  the  smallest  subscript 

of  an  edge  in  J  for  which  a  node  with  the  properties  of  p  exists. 

This  rule  partitions  the  feasible  set,  and  makes  the  current  1-tree 
infeasible  for  each  of  the  new  subproblems  generated,  but  the  number  q  of 
the  new  subproblems  is  often  larger  than  necessary. 


L  )  be  ordered  according  to  nonincreasing  penalties, 
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3R  7,  (Smith  and  Thompson  [1977]),  Choose  a  node  whose  degree  in  the 
current  1-tree  is  not  2,  and  a  maximun-cost  edge  (i,j)  among  those  incident 
with  the  chosen  node.  Then  generate  two  new  subproblems  defined  by 

(30) 

\  *  V2 

This  rule  generates  only  two  successors  of  each  node  k  of  the  search 
tree,  but  the  minimum  1-tree  in  subproblem  k  remains  feasible  for  subproblem 
k  +  2. 

ER  8,  (Volgenant  and  Jonker  [1982]),  Choose  a  node  p  whose  degree  in 
the  current  l-tree  exceeds  2.  Such  a  node  is  incident  with  at  least  two  free 
edges,  say  (ij^ijj^)  and  (12^2)  (otherwise  contains  two  edges  incident 

with  p,  hence  the  remaining  edges  incident  with  p  belong  to  or  should  belong 
to  £,  ).  Generate  three  new  subproblems  defined  by 

IS 


\+i  ■  \  ■ 

\+l  i 

(31) 

Ik+2 

V3  • 

If  p  is  incident  with  an  edge  in  then  node  k+1  is  not  generated. 

This  rule  also  partitions  the  feasible  set  and  makes  the  1-tree  at  node  k 
infeasible  for  each  of  the  successor  nodes,  while  the  number  of  successors  of 


each  node  is  at  most  3 
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Other  features 

Held  and  Karp  [1971]  and  Smith  and  Thompson  [1977]  use  a  depth  first  sub> 
problem  selection  rule,  while  Volgenant  and  Jonker  [1982]  have  Implemented  both 
a  depth  first  and  a  breadth  first  rule,  with  computational  results  that  Indi¬ 
cate  a  slight  advantage  for  the  depth  first  rule  (in  their  Implementation)* 

Extension  to  the  asymmetric  TSP 

The  basic  Ideas  of  the  1-tree  relaxation  of  the  synmetrlc  TSP  carry 
over  to  the  asymnetrlc  case  (Held  and  Karp  [1970]),  In  that  the  1-tree  In 
an  undirected  graph  can  be  replaced  by  a  1-arborescence  In  the  directed  graph 
6  ■  (V,A),  defined  as  an  arborescence  (directed  tree)  rooted  at  node  1, 
plus  an  arc  (1,1)  Joining  some  node  l€A[l}  to  node  1*  The  constraints 
defining  a  1 -arborescence,  namely  (4)  and 


(32) 

1€SJ€V\S  •' 

¥  S  ^  V  :  (136S 

(33) 

r  r  X  -  n 

1€VJ€V 

(34) 

r  X  - 1 

1€V 

are  easily  seen  to  be  a  relaxation  of  the  constraint  set  (2),  (4),  (5)  of 
the  TSP. 

The  problem  of  finding  a  minimum-cost  1-arborescence  can  again  be  de¬ 
composed  Into  two  Independent  problems,  namely  (or)  finding  a  minimum-cost 
arborescence  In  G  rooted  at  node  1,  and  (8)  finding  a  minimum-cost  arc 

(1,1)  In  G.  Problem  (a)  can  be  solved  by  the  polynomial  time  algorithms 

2 

of  Edmonds  [1967]  or  Fulkerson  [1974],  or  by  the  0(n  )-tlme  algorithm  of 
Tar Jan  [1977], 
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To  obtain  the  Lagrangean  version  of  the  1-arborescence  relaxation,  one 
forms  the  function 

(35)  L(X)  -  min  2  S  c  ,x  +  2  \  (  S  x  ,  -  1) 

x€X(G)  i€VJ€V  i€V  j€V 

■  min  2  2  (c  i  +  Ji-i )  *i »  "  S  X .  , 
xGt(G)  i€VJ€V  ^  J  i€V 

where  X(G)  is  the  set  of  incidence  vectors  of  C,  the  family  of  1-arbo- 
rescences  in  G.  Again,  the  strongest  lower  bound  on  v(TSP)  is  of  course 
given  by  X  ■  X  such  that 

(36)  L(X)  =  max  L(X)  , 

X 

and  subgradient  optimization  can  be  used  to  solve  problem  (36),  However, 
computational  experience  with  this  relaxation  (see  Smith  [1975])  shows  it  to 
be  inferior  (for  asymmetric  problems)  to  the  AP  relaxation,  even  when  the 
latter  uses  the  original  objective  f\mction  of  the  TSP, 

4.  Relaxation  III:  the  Assignment  Problem  with  Lagrangean  Objective  Function 
This  relaxation  was  used  for  the  asymmetric  TSP  by  Balas  and  Christofides 
[1981],  It  is  a  considerable  strengthening  of  the  relaxation  consisting 
of  the  AP  with  the  original  cost  function,  involving  a  significant  computa¬ 
tional  effort,  which  however  seems  amply  justified  by  the  computational 
results  that  show  this  approach  to  be  the  fastest  currently  available  method 
for  this  class  of  problems. 

Consider  the  asymmetric  TSP  defined  on  the  complete  directed  graph  G  *  (V,A), 
in  the  integer  programming  formulation  (1),  (2),  (4),  plus  the  subtour-elimination 
constraints.  The  latter  can  be  'written  either  as  (3)  or  as  (5),  but  for  reasons 
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CO  be  explained  later,  we  include  both  (3)  and  (3),  as  well  as  some  positive 
linear  combinations  of  such  inequalities,  and  write  the  resulting  sec  of  subcour- 
elimination  inequalities  in  the  generic  form 


Z  Z  aj.x,,  >  a^ 
i€Vj€V  ° 


t  €  T  . 


Thus  our  integer  programming  formulation  of  Che  TSP  consists  of  (1),  (2),  (4) 
and  (37).  To  construct  a  Lagrangean  relaxation  of  TSP,  we  denote  by  X  Che  feasible 
sec  of  AP,  and  associate  a  multiplier  w^,  c€t,  with  every  inequality  in  the  system 

(37).  We  then  have 


(38)  L(w)  «  min  Z  Z  ^  ^  w  (  Z  Z  a^  x. ,  -  a*^) 

xtXitVj-?;  •'  J  t€T  i€Vj€V 

-  min  Z  2  (c  .  -  Z  w  aj  )x  +  Z  w  a^  , 
xeCi€Vj€'/  ^  t€T  ^  t€T  ^ 

where  w  «  (w^).  Clearly,  the  strongest  such  relaxation  is  given  by  w  -  w  such 


(39)  L(w)  ■  max  L(w) 

w^ 

The  Lagrangean  dual  (39)  of  the  TSP  could  be  solved  by  subgradient  optimi¬ 
zation,  like  in  the  case  of  the  1-tree  relaxation  of  the  symmetric  TSP.  However, 
in  this  case  Che  vector  w  of  multipliers  has  an  exponential  number  of  compo¬ 
nents,  and  until  an  efficient  way  is  found  to  identify  the  components  that  need 
to  be  changed  at  every  iteration,  such  a  procedure  seems  computationally 
expensive,  Balas  and  Christofides  [1981]  therefore  replace  (39)  by  the 
"restricted”  Lagrangean  dual 

(40)  max  L(w)  , 
w6^ 

where 
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w  -  <r 


W 


w  >  0  and  there  exists  u,v6R  such  that 

1 


I 


l-'lj  *13 

■  <  If  .  0 


and  X  Is  the  optimal  solution  found  for  the  AP. 

In  other  words,  (40)  restricts  the  multipliers  to  values  that, 

together  with  appropriate  values  ^  feasible  solution  to  the 

dual  of  the  linear  program  given  by  (1),  (2),  (37)  and  x^^  >  0  ,  i,j£V’. 

This  may  cause  the  value  of  (40)  to  be  less  than  that  of  (39),  but  it 

leaves  the  optimal  solution  x  to  AP,  also  optimal  for  the  objective  function 

(38),  Thus  (40)  can  be  solved  without  changing  x.  While  no  good 

method  is  known  for  the  exact  solution  of  (40),  Balas  and  Christofides  [1981] 

give  a  polynomially  bounded  sequential  noniterative  approximation  procedure, 

A  A 

which  yields  multipliers  w^  such  that  L(w)  typically  comes  close  to  v(TSP): 
for  randomly  generated  asymmetric  TSP's,  L(w)  was  found  to  be  on  the  average 
99.5%  of  v(TSP)  (Christofides  [1979],  p.  139-140). 

The  procedure  starts  by  solving  AP  for  the  costs  c^j,  ¥  i,j,  and  taking  u^^,  v^ 
to  be  the  components  of  the  optimal  solution  to  the  dual  of  AP.  It  then  assigns 
values  to  the  multipliers  sequentially,  without  changing  the  values  assigned 
earlier.  We  say  that  an  inequality  (37)  admits  a  positive  multiplier,  if  there 
exists  a  w^  >  0  which,  together  with  the  multipliers  already  chosen,  satisfies 
the  constraints  of  W.  At  any  stage,  v(TSP)  is  bounded  from  below  by 


(41)  I  u,  +  I  V,  +  Z  w  a° 
^  jtV  ^  t€T  ^ 


i€y 


since  (u,v,w)  is  a  feasible  solution  to  the  dual  of  the  linear  program  defined 
by  il),  (2),  (37)  and  x  >  0,  ¥  i,j. 
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The  bounding  procedure  successively  identifies  valid  inequalities  that 
(i)  are  violated  by  the  AP  solution  x  ,  and 
(ii)  admit  a  positive  multiplier. 

Such  inequalities  are  included  into  L(w)  in  the  order  in  which  they 
are  found,  with  the  largest  possible  multiplier  w^.  The  inclusion  of  each 
new  inequality  strengthens  the  lower  bound  L(w).  V7e  denote  by  the  re¬ 

duced  costs  defined  by  the  optimal  dual  variables  u^  ,  v^  and  the  multipliers 

"t-  =1]  ■  'i]  -  "i  ■ 

At  any  given  stage,  the  admissible  graph  spanning 

subgraph  of  G  containing  those  arcs  with  zero  reduced  cost,  i.e. 


A 


-  (l,j)  €  aIu  +  V  +  I  w  aj 

'  ^  tST  ^ 


where  T  is  the  index  set  of  the  inequalities  Included  so  far  in  L(w).  The 

inclusion  of  each  new  inequality  into  the  Lagrangean  function  adds  at  least  one 

new  arc  to  the  set  A^.  Furthermore,  as  long  as  G^  is  not  strongly  connected,  the 

procedure  is  guaranteed  to  find  a  valid  inequality  satisfying  (i)  and  (ii).  Thus 

the  number  of  arcs  in  A^  steadily  grows;  and  when  no  more  inequalities  can  be 

found  that  satisfy  (i)  and  (ii) ,  is  strongly  connected.  Finally,  if  at  some 

point  G  becomes  Hamiltonian  and  a  tour  H  is  found  in  G  whose  incidence 
o  o 

vector  satisfies  (37)  with  equality  for  all  tcT  such  that  w^  >  0,  then  H  is  an 
optimal  solution  to  TSP  (see  Exercise  5). 

Three  types  of  inequalities.  Indexed  by  .  T2  and  T^  ,  respectively, 
are  used  in  three  noniterative  bounding  procedures  applied  in  sequence.  We 
will  denote  the  three  components  of  w  corresponding  to  these  three  inequality 
classes,  by  ).  -  "  ^\^i€T 


,  respectively 
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Bounding  procedure  1 

This  procedure  uses  the  inequalities  (5)  satisfying  conditions  (i)  and  (il). 
For  any  SCV,  the  set  of  arcs  (S,V\S)  ■  i(i» j)€Aji€S,  j€V'S}  is  called  a  directed 
cutset.  The  inequalities  (5)  corresponding  to  the  node  sets  S^,  t€T,  can  be 
represented  in  terms  of  the  directed  cutsets  «  (S^,  V\S^),  as 


“  *ij  - 


t  €  T^  . 


At  any  stage  of  the  procedure,  the  inequality  corresponding  to  cutset 
is  easily  seen  to  satisfy  conditions  (1)  and  (li)  if  and  only  if 

(43)  n  A  »  0  . 

t  o 

To  find  a  cutset  satisfying  (43),  one  chooses  a  node  i  €  V  and 
forms  its  reachable  set  R(i)  «  fjcVj there  is  a  directed  path  from  1  to  j}  in 
Gg.  If  R(i)  ■  V,  there  is  no  cutset  with  1  ^  satisfying  (43),  so  one  chooses 
another  node.  If  R(i)  *  V  for  some  i  €  7,  then  K,  -  (R(i),  V\R(i))  satisfies  (43), 

W 

and  the  largest  value  that  one  can  assign  to  the  corresponding  multiplier  with¬ 
out  violating  the  constraints  of  W  is  X  »  min  c  .  Thus  the  inequality 

(i,j)tK|. 

(42)  corresponding  to  is  included  in  L(w)  by  setting  the  reduced  costs  to 

c, ,  c. ,  -  (i,1)€K^,  c, ,  *-  c, ,  otherwise.  This  adds  to  A  all  arcs  for 

ij  ij  t’  t*  IJ  ij  o 

which  the  minimum  in  the  definition  of  X^  is  attained.  The  search  is  then  started 
again  for  a  new  cutset;  and  the  procedure  ends  when  the  reachable  set  of  every 
node  is  V.  .\t  that  stage  is  strongly  connected,  and  KflA^  #  0  for  all 

directed  cutsets  K  in  6.  Also,  from  (41)  and  the  fact  that  a^  <■  1,  ¥  t€T^,  it 
follows  thac  procedure  1  improves  the  lower  bound  on  v(TSP)  by  at 

the  end  of  procedure  1  the  lower  bound  is 


B.  -  v(AP)  +  Z  X,.. 
^  t£T 

1 


One  can  show  that  bounding  procedure  1  generates  at  most  (h-1) (h’f2)/2 
cutsets,  where  h  Is  the  number  of  subtours  In  x  (see  Exercise  6).  The  computa¬ 
tional  effort  required  to  find  a  cutset  satisfying  (43)  or  showing  that  none 
exists  Is  0(n|A|). 

Example  2. 

Consider  the  8-clty  ISP  whose  cost  matrix  Is  shown  In  Table  1. 


Table  1 
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10 

11 

12 

10 

9 

12 

X 

3 

8 

7 

10 

10 

10 

6 

3 

1 

X 

Table  2  shows  the  optimal  solution  x  to  AP  (x.  ^  =  1  for  (l,j)  boxed  in, 

Xj^j  *  0  otherwise),  the  optimal  solution  (u,v)  to  the  dual  of  AP  (the  numbers 
on  the  rim),  and  the  reduced  costs  c^^.  The  solution  value  Is  17.  The  correspon 
ding  admissible  graph  Is  shown  In  Fig.  3. 

Bounding  procedure  1.  Cutset  ■  ({l,  2,  3,  7,  8},  [4,  5,  6})  admits 
^l  “  C8  6  *  2,  and  cutset  ■  ((4,  5,  6},  [1,  2,  3,  7,  8})  admits  ^2  *  ^5  3  "  3. 
The  lower  bound  becomes  17  +  2  +  3  >  22.  The  new  reduced  cost  matrix  Is  shown 


In  Table  3  and  the  corresponding  admissible  graph  In  Fig.  4.  Note  that  G 
of  Fig.  4  Is  strongly  connected. |l 
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Table  3 
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Fig.  4.  Graph  G  after  bounding  procedure  1 
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Bounding  procedure  2 

This  procedure  uses  the  inequalities  (3)  that  satisfy  conditions  (i)  and 
(ii),  i.e,  are  violated  by  x  and  admit  a  positive  multiplier.  To  write  these 
ineq\ialities  in  the  general  form  (37),  we  restate  them  as 

(44)  -  r  S  *1,  >1  -  |S,|.  t«i  . 

i€S^j€S^  ^ 

The  subtour  elimination  inequalities  (3)  (or  (44))  are  known  to  be 
equivalent  to  (5)  (or  (42)).  Nevertheless,  an  inequality  (44)  may  admit  a 
positive  multiplier  when  the  corresponding  inequality  (42)  does  not,  and  vice 
versa. 

If  are  the  node  sets  of  the  h  subtours  of  x,  every  in¬ 

equality  (44)  defined  by  S^,  t»l,,.,,h,  is  violated  by  x;  but  a  positive 
'  multiplier  can  be  applied  without  violating  the  condition  that  x^^  ■  1 
implies  c^j  ■  0,  only  by  changing  the  values  of  some  u^^  and  v^  ,  and  this 
in  turn  can  only  be  done  if  a  certain  condition  is  satisfied.  Roughly  speaking, 
we  have  to  find  a  set  of  rows  I  and  columns  J  such  that,  by  adding  to  each  u^,  l€l 
and  V,,  j€j  the  same  amount  la.,.  >  0  that  is  being  added  to  c,  . ,  (i,j)€(S  ,  S  ), 

J  c  IJ  C  t 

we  obtain  a  new  set  of  reduced  costs  c^^  such  that  c^^  >  0  for  all  (i,j),  and 
c^j  ■  0  for  all  those  (i,j)  such  that  x^j  ■  1.  The  condition  for  this  Is  best 
expressed  in  terms  of  the  assignment  tableau  of  the  Hungarian  algorithm  vhose 
rows  and  columns  are  called  lines,  and  whose  row/column  intersections  are  called 
cells.  Cells  correspond  to  arcs  of  G  and  are  denoted  the  same  xny. 

Let  be  the  node  set  of  a  subtour  of  x,  and 


A,  «  {(i.j)€Aji,j€S^},  a;  -  C(iJ)€Ajx^j  -  1] 
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Theorem  2  (Bales  and  Christofides  [1981]).  Inequality  (44)  admits  a 

positive  multiplier  if  and  only  if  there  exists  a  set  C  of  lines  such  that 

(a)  every  (i,J)€A'  is  covered  by  exactly  one  line  in  C  , 

(8)  every  (1,J)€A^NA^^  is  covered  by  at  most  one  line  in  C  , 
t  t 

(y)  no  (i,j)€Ag\A^  is  covered  by  any  line  in  C  . 

If  such  a  set  C  exists,  and  it  consists  of  row  set  I  and  column  set  J, 
then  the  maximum  applicable  multiplier  is 


min  c. 


where 


M  -  (i,j)',a,v  sj  v(v\s^,j)  . 


Proof.  Sufficiency.  Suppose  line  set  C,  consisting  of  row  set  I  and 
column  set  J,  satisfies  (a),  (8),  (y)  •  Then’ adding  >  0  to  c^j  for  all 

(i,j)€(S^,  S^),  as  well  as  to  all  u^^,  t6l  and  v^,  j€j,  produces  a  set  of  reduced 

costs  c^^  such  that  c^j  ■>  0  for  (i,j)^g»  since  C  ■  lUJ  satisfies  (a).  Further, 
since  C  satisfies  (3)  and  (y),  >  c^^  »  0  for  all  (i,j)^^\A^,  and  c^'j  »  c^^^  «  0 

for  all  (i,J)€A^'''Aj..  The  only  reduced  costs  chat  are  diminished  as  a  result 
of  the  above  changes,  are  those  corresponding  to  arcs  in  one  of  the  three  sets 
(I,J),  (I,V\Sj.),  (V\S^,  J)  whose  union  is  the  set  M  of  the  theorem.  Hence 

setting  Uj.  equal  to  the  minimum  reduced  cost  over  M  provides  a  positive  multiplier 

that  can  be  applied  to  the  arcs  in  (S^,  S^). 

Necessity.  Suppose  a  multiplier  p,  >  0  can  be  applied  to  the  arc  set  (S^,  S^). 
In  order  to  prevent  the  c^^j  for  (i,J)€A^  from  becoming  positive,  one  must  increase 


Ui  -v^ 


by  ^  for  all  (i,J)€A|^.  If  this  can  be  done,  it  can  be  done  by  adding  u 
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to  or  Vj  (but  not  to  both)  for  (t,J)€A^;  and  the  corresponding  index  sets  I 

and  J  form  a  set  C  «•  lUJ  that  satisfies  (a).  Let  C  be  the  collection  of  all 

sets  C  obtained  in  this  way.  Now  take  any  C€C.  If  C  violates  (3),  then 

c/,  ■  c  ,  +  n  “  2jj,  <  c.  ■  0  for  some  (i,J)€A  \A^,  and  if  it  violates  (Y),  then 
ij  ij  ij  t  c 

<  Cj^j  ■  0  for  some  (i, j)6A^\Aj..  Since  by  assumption  p  >  0  can  be  applied 
to  (S^,  S^),  there  exists  at  least  one  set  C€C  that  satisfies  both  (P)  and  (Y).ll 
To  check  whether  for  a  given  sub tour-node- set  there  exists  a  set  of 
lines  C  satisfying  conditions  (a),  O),  (y)»  we  proceed  as  follows. 

First  we  construct  a  set  R~  of  rows  that  cannot  belong  to  C,  and  a  set  K*** 
of  columns  that  must  belong  to  C,  if  conditions  (a),  (p),  (y)  are  to  be  satisfied. 
To  do  this,  we  start  with  K"*"  ■  P  and  in  view  of  (y),  put  into  R"  all  rows  i  for 
which  there  exists  a  cell  (i,J)€A^  with  j€V\S^.  Then  we  apply  recursively  the 
following  two  steps,  until  no  more  additions  can  be  made  to  either  set: 

If  a  row  i  was  put  into  R",  then  to  satisfy  (a)  we  put  into  K"*"  every  column 
J  such  that  (i,j)6A'. 

If  a  column  J  was  put  into  then  to  satisfy  (P)  we  put  into  R*  every 

row  h  such  that  (h,j)6A^. 

To  state  the  procedure  formally,  we  set  K'*'  *  0, 

o 

R^  -  U€sja(i,j)^^  with  j€v\s^}, 
and  define  recursively  for  r  >  l,...,r. 

Here  r  is  the  smallest  r  for  which  k'*’  *  ,  or  R"  ■  R*  , . 

r  r-1  r  r-1 
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Next  we  uie  a  perfectly  analogous  procedure  to  construct  a  set  of  rows 
that  must  belong  to  C  and  a  set  k"  of  columns  that  cannot  belong  to  C,  If  (a), 
(3).  (Y)  are  to  hold.  In  other  words,  we  set  R^  ■  0, 

K"  «  {j€Sj.la(i,j)€A^  with  i€v\S^}, 
and  define  recursively  for  s  «  l,...,s, 

K'  -  k'.^U  (j€sJa(i,j)€A^  with  iCr"} 

Here  s  »  min{8^,  s^l,  where  Sj^  is  the  smallest  s  such  that  1 

K"  *  K  ,,  and  s.  is  the  smallest  s  such  that  R^nRljt0orK  nKij<0, 
s  s-1’  2  8  r  8  r 

If  8  «  s^,  then  some  row  or  some  column  that  cannot  belong  to  C,  must  belong 
to  C  for  (a),  (0),  (y)  to  hold;  hence  there  exists  no  set  C  of  lines  satisfying 
(a) I  (0)>  (y)i  and  no  positive  multiplier  can  be  applied  to  the  inequality  (44) 
corresponding  to  S^. 

If  8  «  s., ,  then  the  set  of  lines  C  »  lUJ,  where  1  *  SArI  and  J  »  Ici. 

i  t  r  r' 

satisfies  conditions  (a),  (0),  (y).  Thus  we  include  the  Inequality  (44)  corre¬ 
sponding  to  into  L(w)  with  the  multiplier  >  0  defined  in  Theorem  2,  and 
set  the  reduced  costs  to  c^^j  *-  c^^^  -  (i,j)6M,  c^^^  -  c^^j  otherwise.  (Here 

M  is  the  set  defined  in  Theorem  2.) 

In  both  cases,  we  then  choose  another  subtour,  until  all  subtours  have  been 
examined.  If  h  is  again  the  number  of  subtours,  bounding  procedure  2  requires 
0(h*|A|)  steps.  It  can  be  shown  (see  Exercise  7)  that  this  procedure  improves 
the  lower  bound  on  v(TSP)  by  E^P^i  i.e.,  at  the  end  of  procedure  2  the  lower 

B,  -  v(AP)  +  E  X,.  +  E  p,.  . 

^  tCT^  t€T2 


bound  is 
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Example  2  (continued) . 

Bounding  procedure  2.  The  subtoure  of  x  ere  (1,  2,  3),  (4,  5,  6)  and 
(7,  8) (see  Table  3  and  Fig.  4). 

For  »  {1,  2,  3},  -  {3},  -  {l};  k"  -  {3},  r|  -  {2}.  Thus  C  «  lUJ, 

where  I  ■  {1,  2},  J  •  {ij,  end  "  ®2  6  *  ^2  “  ^o  “ 

Kj  -  {4};  k’  -  t6},  R^  -  {S},  end  C  -  lUJ,  with  I  -  l4,  5},  J  -  (4},  and 
^^2  “  ®5,4  *  S3  -  {7,  8},  r'  -  (8},  x|  -  [7};  K"  -  (7).  and 

since  K^DKj^  »  {7}  0,  the  inequality  corresponding  to  subtour  (7,  8)  does  not 

admit  a  positive  multiplier. 

The  lower  bound  becomes  B2  ■  *  -2  +  1  +  2  *  25.  The  new  reduced 

costs  are  shown  in  Table  4,  and  the  corresponding  admissible  graph  in  Fig.  S.]] 

Table  4 
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3 

3 

2 

3 

2 

2 
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0 

2 
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0 

4 
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0 
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4 
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0  8 
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X 
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3 

0 

0 

X 

Fig.  5.  Graph  after  bounding  procedure  2 


Bounding  procedure  3 

The  class  of  inequalities  used  in  this  procedure  is  defined  as  follows. 
Suppose  has  an  articulation  point,  i.e.  a  node  k  such  that  G^  -  {k}  has 

more  than  one  component.  Let  one  of  the  components  have  node  set  S^,  and  denote 
»V\S^U{k}.  Then  every  tour  contains  an  arc  of  at  least  one  of  the  cutsets 
•  (S^,W^)  and  hence  the  incidence  vector  x  of  any  tour 

satisfies  the  inequality 


(45) 


S  X  >  1  . 

(i,j)€K'j.UK'^' 


Furthermore,  (45)  satiffies  condition  (i),  i.e,  is  violated  by  the  AP  solution 


Bounding  procedure  3  uses  those  inequelltles  (45)  thet  also  satisfy 
condition  (11) .  Although  every  Inequality  (45)  Is  the  combination  of  some 
Inequalities  (3)  and  equations  (2)  (see  Exercise  8),  levertheless  It  is  possible 
to  find  Inequalities  (45)  that  satisfy  condition  (11),  l.e.,  admit  a  positive 
multiplier,  when  no  Inequality  (3)  (l.e.,  (44))  satisfies  It.  Indeed,  It  is  not 
hard  to  see,  that  If  k  Is  an  articulation  point  of  and  Is  the  node  set  of 
one  of  the  components  of  -  {k},  then  K^HA^  =  k'Vi  A^  =  0  and  a  positive 
multiplier  given  by 


(46) 


min  c  . 

(l,j)€K'^UK'^' 


can  be  applied  to  the  arc  set  .  On  the  other  hand,  if  G  has  no  artlcula- 

t  t  o 

/ 

tion  point,  then  for  any  choice  of  the  node  k,  the  minimum  in  (46)  is  0  and  thus 
no  Inequality  (45)  admits  a  positive  multiplier. 

Thus  bounding  procedure  3  checks  for  «*>»ery  iCV  whether  it  is  an  articulation 
point,  and  If  so.  It  takes  the  corresponding  inequality  (45)  into  L(w)  with  the 
multiplier  given  by  (46).  This  is  done  by  setting  c^^  »-  c^^  -  v^,  (1,  j)€K|.' UK"  , 
•“  c^j  otherwise.  Since  has  n  nodes,  and  testing  for  connectivity  requires 
O(jAj)  steps,  bounding  procedure  3  requires  O(njAj)  steps. 

In  view  of  (41)  and  the  fact  that  (45)  has  a  righthand  side  of  1,  at  the  end 
of  bounding  procedure  3  one  has  the  following  lower  bound  on  v(TSP); 

B,  »  v(AP)  +  E\  +Zp,  +  Zv. 

t€Tj^^  t€T2  '  ^ 


Example  2  (continued) . 

Vertex  6  is  an  articulation  point  of  G^  (see  Fig.  5).  The  corresponding 
cutsets  are  k'  -  ((4,  5},  {l,  2,  3,  7,  8})  and  K " «  ((1,  2,  3,  7,  8},  (4,  5}), 
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Additional  bounding  procedures 

At  the  end  of  bounding  procedure  3,  Is  strongly  connected  and  wlthoL-t 

articulation  points.  At  that  stage  an  attempt  Is  made  to  find  a  tour  In  G 

o* 

For  that  purpose  a  specialized  Implicit  enumeration  technique  Is  applied, 
with  a  cut-off  rule.  If  a  tour  H  Is  found  whose  Incidence  vector  x  satis¬ 
fies  with  equality  all  those  Inequalities  (37)  such  that  w^.  >  0,  then  S  la 

optimal  for  the  current  subproblem  (this  follows  from  elementary  Langrangean 
theory). 

EMmpU  2  (continued).  The  following  tour  cen  be  Identified  by  Inspection 
in  of  Fig,  6:  H  -  (Cl.  2).  (2.  3).  (3,  7), (7,  8),  (8,  6),  (6,  A),  (4,  5),  (5.  1)}, 
The  velue  of  H  1.  26,  e,uel  to  L(w)  .  Bj,  the  lower  bound  et  the  end  of  procedure  3. 
The  tour  H  contelns  erectly  one  ere  of  eech  cutset  essoclsted  with  e  positive 
nenely  ere  (8.  6)  of  -  ((1.  2.  3,  7.  8),  [4.  5.  6)),  end  .rc  (5,  1)  of 

*^2  (Tf  2,  3,  7,  8}),  Thus  the  Incidence  vector  of  H  setisfles 

with  eguellty  the  two  loequelltles  (42)  corresponding  to  end  ss  required. 
Further.  H  contelns  erectly  IsJ  -1.2  .res  of  the  subtour  with  node  set 
Sj  -  {1.  2,  3),  nenely.  (1,  2)  end  (2.  3);  end  erectly  -1.2  sres  of  the 
subtour  with  node  set  -  (4.  5.  6).  nnsely  (6,  4)  end  (4,  5).  Thus  the 
conplemeotstlty  condition  Is  elso  setlsfled  for  the  two  Inequslltles  (44) 
corresponding  to  Sj  snd  S^.  Flnslly.  It  contelns  erectly  one  ere  of  the  set 
Kj'UK'',  where  -  ((4,  5),  (l,  2.  3,  7,  8)),  ((1,  2.  3,  7.  8),  [4,  5)). 

Mccly  (5.  1):  so  the  conplenenterlty  condition  slso  holds  for  the  Inequellty 
(45)  corresponding  to  Kj'Uk"  .  m  conclusion.  H  is  optlnsl.H 

If,  sfter  bounding  procedure  3,  s  tour  H  Is  found  ouch  thet  r  vlolsteo  this 
conplenenterlty  condition  for  some  t  €  T,  then  ettenpts  ere  node  to  replsco  those 
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inequalities  (37)  that  are  "out  of  kilter,"  i,e.,  for  which  the  complementarity 
condition  is  violated,  by  "in  kilter"  inequalities  (of  the  same  type),  i.e., 
inequalities  that  are  tight  for  x  and  thus  admit  positive  multipliers  satisfying 
the  complementarity  condition.  These  attempts  consist  of  a  sequence  of  three 
additional  bounding  procedures,  called  4,  5  and  6,  one  for  each  type  of  inequality 
(42),  (44)  and  (45),  respectively.  Bounding  procedure  4  takes  in  turn  each  in¬ 
equality  (42)  which  has  a  positive  multiplier  and  yet  is  slack  for  x,  and  per¬ 
forms  an  exhaustive  search  for  other  inequalities  of  type  (42)  that  could  replace 
the  inequality  in  question  (with  new  multipliers)  and  which  are  tight  for  x.  If 
the  search  is  successful,  the  in  kilter  inequalities  with  their  new  multipliers 
replace  the  out  of  kilter  inequality  and  one  proceeds  to  the  next  out  of  kilter 
inequality  of  type  (42) .  Procedures  5  and  6  perform  the  same  furxtion  for  out 
of  kilter  inequalities  of  type  (44)  and  (45),  respectively.  These  procedures  are 
described  in  detail  in  Balas  and  Christofides  [19811.  When  procedures  4,  5  and 
6  are  not  successful  in  replacing  all  out  of  kilter  inequalities  (and  thus  proving 
H  to  be  an  optimal  tour),  they  nevertheless  strengthen  the  lower  bound  on  v(TSP). 

Each  of  the  six  bounding  procedures  is  polynomially  bounded.  This  (worst 
case)  bound  is  0(n  )  for  procedure  1,  0(n  )  for  each  of  the  other  procedures. 
The  mean  times  are  considerably  shorter,  and  on  the  average  procedure  2  (the 
only  one  that  changes  the  dual  variables  u^,  v^)  takes  the  longest.  The 
general  algorithm  of  course  remains  valid  if  any  subset  of  the  six  bounding 
procedures  is  used  in  place  of  the  full  set,  but  computational  testing  indi¬ 
cates  that  using  all  6  procedures  is  more  efficient  (i.e.  results  in  smaller 
search  trees  and  shorter  overall  computing  times)  than  using  any  proper  subset. 


43 


Branching  rules  and  other  features 

Before  branching,  all  arcs  (l,j)  such  that  c^j  >  U  -  L(w)  are  deleted 
from  G,  This  "reduction"  has  removed  on  a  set  of  120  randomly  generated  problems 
(Balas  and  Christofides  [1981]),  on  the  average  96-977.  of  the  arcs  in  problems 
with  up  to  150  variables,  and  987.  in  problems  with  175-325  variables. 

The  AP  relaxation  with  Lagrangean  objective  function  can  of  course  be  used 
with  any  of  the  branching  rules  BRl  -  BR5  described  in  the  context  of  the  AP 
relaxation  with  objective  function  (1).  Balas  and  Christofides  [1981]  use  two 
rules  intermittently,  namely  BR3  (partitioning  on  the  basis  of  a  subtour  elimi¬ 
nation  Inequality  (3)),  and  another  rule  based  on  a  disjunction  from  a  condi¬ 
tional  bound,  introduced  earlier  in  the  context  of  set  covering  (Balas  [1980]). 
This  latter  rule  is  motivated  by  the  following  considerations. 

A 

Let  H  be  the  current  tour  and  x  its  Incidence  matrix.  Remove  from  L(w) 
all  those  inequalities  (37)  that  are  .slack  while  the  associated  multiplier  is 
positive.  Let  c^j  be  the  reduced  costs,  and  L(w)  the  lower  bound,  resulting 
from  this  removal. 

Theorem  3.  Let  SSH,  S  «  f(i  j  ) . (i  j  )}  be  such  that 

i  i  P  P 


(47)  I  a  >  U  -  L(w), 
r«l  r^r 

and  let  the  arc  sets  Q^c:a,  r  «  l,...,p,  satisfy 

(48)  .  <c  ,  (i,j)€A 

rl(i,j)€Q^  ^r^r 


rhen  every  solution  x  to  TSP  such  that  cx  -r.  U  satisfies  the  disjunction 


V  (X  -  0,  (i,j)€Q  ). 
r»l  ^ 


44 


Proof.  L(w)  is  the  value  of  an  optimal  solution  to  the  dual  of  the  linear 
program  LP  defined  by  (1),  (2),  >  0,  (i,j)6A,  and  those  inequalities  (37) 

with  a  positive  multiplier.  Now  let  x  be  a  feasible  solution  to  LP  that  violates 

(49) .  Then  x  satisfies 

(50)  SmUx  >1  ,  r-l,...,p. 


Let  LP_|_  be  the  linear  program  obtained  by  adding  to  LP  the  constraints  (50). 
From  (48),  if  we  assign  the  values  ,  ,  r  *  l,...,p  to  the  dual  variables 


r'r 


associated  with  the  inequalities  (50),  we  obtain  a  feasible  solution  to  the  dual 

P  ^ 

of  LP_|_.  But  then  the  objective  function  value  of  this  solution  is  L(w)  ^  < 

r»l  ^r^r 

and  hence  from  (47) 


P  . 

cx  >  L(w)  +  E  i  >  U. 

r*l  ^r'^r 

Thus  every  solution  x  to  TSP  such  that  cx  <  U  satisfies  (49). |1 
The  branching  rule  can  now  be  stated  as  follows. 

BR9.  Choose  a  minimum- cardinality  set  SCH,  S  =  {(ij^,  jj^) . (ip,  jp)}, 

satisfying  (47).  Next  construct  a  px|A|  0-1  matrix  D  «  (where  r  is  the 

row  index  and  (i,j)  the  column  index),  with  as  many  I's  in  each  column  as  possible 
subject  to  the  condition  (48)  and  (i^,  j^)^Q^,  r  «  l,...,p,  where 

Qr  *  =  !}• 

Generate  the  p  new  subproblems  defined  by  the  disjunction  (49),  \^ere 
th 

the  r-  subproblem  is  given  by 

\+r-\UQ^  I 

(51) 

} 


^+r  “  ^ 


f  *!,•••, p 
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The  branching  rule  BR9  is  used  interaittently  with  BR3  becaiise  at  different 
nodes  the  ranking  of  the  two  rules  (in  terms  of  strength)  may  be  different.  The 
choice  is  based  on  certain  indicators  of  relative  strength. 

As  to  subproblem  selection,  the  Balas-Christofides  algorithm  uses  a  mixtu^'e 
of  depth  first  and  breadth  first:  a  successor  of  the  current  node  is  selected 
whenever  available;  otherwise  the  algorithm  chooses  a  node  k  that  minimizes 
the  function 

E(k) 

where  L(w)j^  is  the  value  of  L(w)  at  node  k,  v(AP)  is  the  value  of  the  initial  AP 
while  3(0)  and  s(k)  are  the  number  of  subtours  in  the  solutions  to  the  initial 
AP  and  the  one  at  node  k,  respectively, 

5.  Other  Relaxations 

For  the  same  reasons  as  in  the  case  of  the  AP  relaxation  with  the  original 
objective  function,  the  AP  relaxation  with  the  Lagrangean  objective  function  is 
inefficient  (weak)  in  the  case  of  the  symmetric  TSP.  Limited  computational 
experience  indicates  that  on  the  average  the  bound  L(w)  attains  about  96% 
of  v(TSP),  vhtch  compares  unfavorably  with  the  bound  obtained  from  the  1-tree 
relaxation. 

On  the  other  hand,  the  main  reason  for  the  weak  performance  of  AP-based 
relaxations  in  the  case  of  symmetric  problems,  namely  the  high  frequency  of 
subtours  of  length  2  in  the  optimal  AP  solution,  can  be  eliminated  if  AP  is 
replaced  by  the  2-matching  problem  in  the  undirected  graph  G  ■  (V,E), 


X*. 
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The  2-natchlng  relaxation 

The  problem  of  minimizing  the  function  (6)  subject  to  constraints  (7)  and 

(9)  is  known  in  the  literature  as  the  2-matching  problem,  and  is  obviously 
a  relaxation  of  the  ISP.  Bellmore  and  Malone  [1971]  have  used  it  for  the 
symmetric  TSP  in  a  way  that  parallels  their  use  of  the  AP-relaxation  for  the 
asymmetric  TSP.  A  2-matching  is  either  a  tour  or  a  collection  of  subtours, 
and  the  branching  rules  BR2  -  3R5  based  on  the  subtour-elimination  inequalities 
(3)  and  (5)  for  the  asymmetric  TSP  have  their  exact  parallels  in  branching  rules 
based  on  the  subtour  elimination  inequalities  (8)  and  (10)  for  the  symmetric  TSP 

The  objective  function  (6)  can  be  replaced,  just  like  in  the  case  of  the 
A?  relaxation,  with  a  Lagrangean  function  using  the  inequalities  (S)  and/or 

(10) ,  The  Lagrangean  dual  of  the  TSP  formulated  in  this  way  is  as  hard  to 
Solve  exactly  as  in  the  asymmetric  case,  but  it  can  be  approximated  by  a  pro¬ 
cedure  similar  to  the  one  used  by  Balas  and  Chris tof ides  [1981]  with  the 
AP-relaxation,  Further  facet  defining  inequalities,  beyond  (8)  and  (10), 
based  on  the  work  of  Grotschel  and  Padberg  [1979],  can  be  used  to  enrich  the 
set  drawn  upon  in  constructing  the  Lagrangean  function. 

Although  the  2-matching  problem  is  polynomially  solvable  (Edmonds  [1965])^ 
the  main  impediment  in  the  development  of  an  efficient  branch  and  bound  proce¬ 
dure  based  on  the  2-matching  relaxation  has  so  far  been  the  absence  of  a  good 
implementation  of  a  weighted  2-matching  algorithm.  However,  as  this  difficulty 
is  likely  to  be  overcome  soon,  the  2-matching  relaxation  wich  a  Lagrangean 
objective  function  will  in  all  likelihood  provide  bounds  for  the  symmetric  TSP 
comparable  to  those  obtained  from  the  1-tree  relaxation. 
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The  n-path  relaxation 

The  problem  of  minimizing  (1)  subject  to  the  constraint  that  the  solution 
X  be  the  incidence  matrix  of  a  directed  n-path  starting  and  ending  at  a  fixed 
node  V  (where  "path"  is  used  in  the  sense  of  walk,  i.e.,  with  possible  repeti¬ 
tions  of  nodes,  and  n  denotes  the  length  of  the  path)  is  clearly  a  relaxation 
of  the  TSP.  An  analogous  relaxation  of  the  symmetric  TSP  can  be  formulated  in 
terms  of  n-paths  in  the  associated  undirected  graph.  Furthermore,  the  constraints 
(2)  in  the  asymmetric  case,  or  (7)  in  the  symmetric  case,  can  be  used  to  replace 
the  objective  function  (1)  or  (6),  respectively,  by  a  Lagrangean  function  of  the 
same  type  as  the  one  used  with  the  1-arborescence  and  1-tree  relaxations.  This 
family  of  relaxations  of  the  TSP  was  introduced  by  Houck,  Picard,  Queyranne  and 
Vamuganti  fl9'^7].  The  (directed  or  undirected)  n-path  problems  involved  in  this 
relaxation  can  be  solved  by  a  dynamic  programming  recursion  in  0(n  )  steps. 
Computational  experience  with  this  approach  seems  to  indicate  (Christofides  [1979], 
p,  142)  chat  the  quality  of  the  bound  obtained  is  comparable  to  the  one  obtained  from 
the  l-arborescence  relaxation  in  the  asymmetric  case,  but  slightly  weaker  than 
the  bound  obtained  from  the  1-cree  relaxation  in  the  symmetric  case.  Since 
solving  the  1-tree  and  l-arborescence  problems  is  computationally  cheaper  than 
solving  the  corresponding  n-path  problems,  this  latter  relaxation  seems  to  be 
dominated  (for  the  case  of  the  "pure"  TSP)  by  the  1-tree  or  l-arborescence  re¬ 
laxation.  However,  the  n-path  relaxation  can  easily  accomodate  extra  condi¬ 
tions  which  the  1-tree  and  l-arborescence  relaxations  cannot,  and  which  often 
occur  in  problems  closely  related  to  the  TSP  (traveling  salesman  problems  with 
side  constraints  appear  in  vehicle  routing  (see  Chapter  12  of  this  book)  and 
other  practical  contexts.) 
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A  substantial  generalization  of  the  n-path  relaxation,  due  to  Chrlstofldes , 
Mlngozzl  and  Toth  [1981]  and  called  state-space  relaxation,  has  the  same 
desirable  characteristics  of  being  able  to  easily  accommodate  side  constraints. 

The  LP  with  cutting  planes  as  a  relaxation 

Excellent  computational  results  have  been  obtained  recently  by  Crowder 
and  Padberg  [1980]  for  the  symmetric  TSP  by  a  cutting  plane/branch  and  bound 
approach.  It  applies  the  primal  simplex  method  to  the  linear  program  defined 
by  (6),  (7),  x.j  >  0,  Vi,J,  and  an  unspecified  subset  of  the  inequalities 
defining  the  convex  hull  of  incidence  vecto'*s  of  tours,  generated  as  needed 
to  avoid  fractional  pivots.  The  procedure  uses  mostly  inequalities  of  the 
form  (10),  but  also  other  facet  inducing  inequalities  from  among  those  intro- 

If 

duced  by  Grotschel  and  Padberg  [1979].  When  the  search  for  the  next  inequality 
needed  for  an  integer  pivot  falls,  the  procedure  branches.  Since  the  main 
feature  of  this  approach  is  the  identification  of  appropriate  inequalities  to 
be  added  to  the  linear  program  at  each  step,  it  is  being  reviewed  in  the 
chapter  on  cutting  plane  methods. 

6.  Performance  of  State  of  the  Art  Computer  Codes 
In  this  section  we  review  the  performance  of  some  state  of  the  arc  branch 
and  bound  codes  for  the  TSP,  by  comparing  and  analyzing  the  computational  results 
reported  by  the  authors  of  these  codes. 

The  asymmetric  TSP 

The  three  fastest  currently  available  computer  codes  for  the  asymmetric  TSP 
seem  to  be  those  of  Balas  and  Chrlstofldes  [1981],  Carpaneto  and  Toth  [1580] 
and  Smith,  Srinivasan  and  Thompson  [1977],  to  be  designated  in  the  following  by 
3C,  CT  and  3ST,  respectively.  The  main  characteristics  of  these  codes  are  summa- 
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rlzed  in  Table  6<  Table  7  describes  the  computational  results  reported  by  the 
authors  of  the  codes.  Each  of  the  codes  was  run  on  a  set  of  (different)  asymmet¬ 
ric  TSP's  whose  costs  were  Independently  drawn  from  a  uniform  distribution  of  the 
Integers  in  the  Interval  [1,1000].  The  entries  of  the  table  represent  averages 
for  5  problems  (SST),  20  problems  (CT)  and  10  problems  (BC),  respectively,  in 
each  class.  The  number  of  nodes  in  the  SST  column  is  not  strictly  comparable 
with  that  In  the  CT  and  BC  columns,  since  It  Is  based  on  counting  only  those 
nodes  that  were  selected  for  branching  and  processed.  Also,  the  computing  times 
are  not  strictly  comparable  without  a  correction,  since  the  CDC  7600  Is  about  3 
times  faster  than  the  UNIVAC  1108  and  the  CDC  6600  (Computer  Review.  GML  Corp., 
Lexington,  MA,  1979).  The  picture  that  emerges,  however,  by  comparing  the 
figures  within  each  colimm,  for  any  of  these  three  codes.  Is  a  pattern  of  growth 
In  computational  effort  with  problem  size,  that  seems  rather  modest  for  a  problem 
usually  viewed  as  "notoriously  Intractable",  We  will  discuss  the  functional 
relationship  between  problem  slz^  and  computational  effort  In  some  detail  further 
below. 

For  problems  In  the  range  40  <  n  <  180,  the  number  of  nodes  generated  by 
the  BC  algorithm  Is  considerably  smaller  than  the  corresponding  numbers  for  the 
other  two  algorithms,  although  CT  uses  a  "breadth  first”  branching  strategy,  meant 
to  minimize  the  number  of  nodes  generated,  at  the  cost  of  Increased  storage 
requirements.  The  reason  for  this  Is  that  the  Lagrangean  bounding  function  used 
by  BC  changes  the  ranking  of  tours  among  the  assignments,  removing  from  considera¬ 
tion  many  assignment  problems  whose  value  In  terms  of  the  original  objective 
function  Is  higher  than  that  of  the  optimal  TSP,  and  which  therefore  must  be 
processed  by  the  CT  algorithm.  On  the  other  hand.  In  the  range  200  £  n  <  240, 
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Table  6.  Summary  description  of  three  codes  for  the  asymmetric  TSP 


Computing  time  (seconds) 


-■!  'M'  l.4.«Pii^ 
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BC  ueems  Co  generate  more  nodes  chan  CT;  Che  reason  for  this  may  be  chat  ac 
this  point  the  advantage  of  the  "breadth  first"  strategy  used  by  CT  outweighs 
chat  of  the  stronger  bounding  procedures  used  by  BC.  This  seems  to  suggest  chat 
an  algorithm  based  on  the  Lagrangean  bounding  procedures  of  BC,  but  using  the 
"breadth  first"  node  selection  strategy  of  CT,  will  generate  fewer  nodes  for 
any  problem  size,  Chan  either  the  CT  or  the  BC  algorithms.  This  is  undoubtedly 
true,  but  notice  that  at  the  current  state  of  the  art,  the  limiting  factor  in 
the  use  of  both  algorithms  is  not  computing  time  (which  has  never  exceeded  1.5 
minutes  for  any  problem),  but  (in  core)  storage  space. 

The  symmetric  TSP 

The  fastest  currently  available  branch  and  bound  codes  for  the  symmetric  TS? 
seem  to  be  those  of  Smith  and  Thompson  [1977]  and  Volgenant  and  Jcnker  [1982], 
to  be  designated  in  the  following  by  ST  and  VJ,  respectively.  Table  8  summa¬ 
rizes  their  main  characteristics,  while  Table  9  reports  on  their  computational 
performance. 

Again,  each  of  Che  two  codes  was  run  on  a  sec  of  (different)  symmetric 
TSF's  whose  costs  were  independently  drawn  from  a  uniform  distribution  of  the 
integers  in  the  interval  [1,1000] .  The  entries  of  the  cable  represent  averages 
for  15  problems  (except  for  n-80,  where  the  entry  for  SST  is  the  average  for 
5  problems  only).  The  CYBER  750  is  about  3  times  faster  than  the  UNIVAC  1108. 

Both  codes  were  also  tested  on  randomly  generated  symmetric  Euclidean 
TSP's,  which  required  for  each  code  a  greater  computational  effort  (e.g.,  for 
n  >  60  the  average  number  of  subgradient  iterations  was  3049  for  ST  and 
1034  for  VJ). 
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Table  8.  Summary  description  of  two  codes  for  the  symmetric  TSP 


'  f 

ST  VJ 

Relaxation 

1 

1 

l-tree  with  !  1-tree  with  ' 

Lagrangean  objective  ;  Lagrangean  objective 

Lower  bounding 

i 

i 

! 

1 

1  . 

"■  j 

subgradienC  1  subgradienc 

optimization  |  optimization  with 

» 

i  convex  combination 

j  of  subgradients 

Branching  rule 

> 

3R7  '  3R8  ■ 

t  1 

Subprcblsn  selection 

1 

depth  first  1  depth  first 

1 

Upper  bounding 

no  special  procedure 

no  special  procedure 

1  Variable  fixing 

no 

I 

yes  i 

! 

Table  9.  Computational  results  on  randomly  generated  symmetric  TSP's 


n 

Nodes  of  Che 
search  tree 

Subgradienc 

iterations 

■  - . - 

Computing  time 
(seconds) 

ST^l) 

Vj(2) 

ST 

VJ 

St(5) 

vj(^> 

i 

1 

■ 

50 

17 

526 

- 

22.1 

- 

60 

15 

572 

352 

34.1 

4.7 

1 

70 

19 

760 

- 

61.6 

- 

» 

1  SO 

15 

764 

702 

33.0 

15.5 

; 

i 

j  100 

• 

1664 

53.2 

i 

_J 

(1)  Number  of  nodes  chat  were  explored;  (2)  not  reported;  (3)  UNIVAC  1108; 
(4)  CYBER  75:. 
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Average  performance  as  a  function  of  problem  size 

IS?  is  well  known  to  be  NP-complete,  hence  In  all  likelihood  there  Is  no 
polynomial  time  TSP  algorithm,  l.e.  no  algorithm  guaranteed  to  solve  every  Instance 
of  TSP  In  a  number  of  steps  polynomial  in  n.  However,  this  statement  refers  to 
the  worst  case  behavior  of  algorithms,  and  does  not  exclude  the  existence  of 
algorithms  whose  performance,  though  exponential  in  the  worst  case,  is  on  the 
average  polynomial  in  n.  To  make  the  colloquial  term  "on  the  average"  more 
precise,  assume  the  coats  c^j  of  TSP  are  random  numbers  drawn  inde¬ 
pendently  from  a  uniform  distribution  over  the  unit  interval.  Whether  the 
expected  time  required  to  solve  such  a  problem  is  an  exponential  or  polynomial 
function  of  n.  Is  at  present  an  open  question,  on  which  the  opinion  of  experts 
is  divided  (see,  for  instance,  Bellmore  and  Malone  [1971],  and  Lenstra  and 
Rinnooy  Kan  [1978]). 

While  the  theoretical  issue  remains  unsolved,  it  is  not  Irrelevant  to 
examine  from  this  point  of  view  the  empirical  performance  of  some  of  the  more 
efficient  algorithms  on  randomly  generated  TSP's.  In  a  recent  study,  Balas, 

McGuire  and  Toth  [1983]  have  fitted  three  different  approximating  curves  to  the 
data  of  Table  7  for  each  of  the  three  codes  SST,  CT  and  BC  for  the  asymmetric 
TSP,  in  an  attempt  to  determine  which  of  the  three  types  of  functions  describes 
best  the  behavior  of  each  algorithm.  The  data  of  Table  7  were  corrected  for  the 
difference  in  speed  between  the  GDC  7600  and  the  other  two  computers  by  multiplying 
by  3  the  computing  times  reported  for  the  Balas -Chris tof ides  code.  The  functions 


examined  were: 

f(n) 

*  on^ 

(polynomial). 

f(n) 

- 

(superpolynomial 

f(n) 

“  0(6 

(exponential) , 

where  log  stands  for  the  natural  logarithm  and  e  for  its  base. 
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Each  of  the  three  functions  was  expressed  in  logarithmic  form,  and  a  simple 

regression  of  log  f(n)  was  run  on  log  n  (in  the  case  of  the  polynomial  function), 
2 

on  log  n  (in  the  case  of  the  super polynomial  function),  and  on  n  (in  case  of 
the  exponential  function),  in  order  to  find  the  best  fitting  values  of  c'  and  3 
for  each  case.  The  outcome  is  shown  in  Tables  10,  11  and  12. 

Table  10.  Statistical  analysis  of  the  Smith-Srinivasan-Thompson  algorithm 


50  <  n  <  180 


Type  of 
function 

Best  fit 

...  ...  . .  , 

Standard 
error  of 
estimation 

Coefficient  of 
determination 

Polynomial 

0.38  X  10'^  X 

0.505 

0.883 

Superpolynomial 

0.105  X  10"^  X 

0.519 

0.877 

Exponential 

1.19  X 

0.595 

0.838 

Table  11.  Statistical  analysis  of  the  Carpaneto-Toth  algorithm 


40  <  n  <  240 


Type  of 
function 

Best  fit 

Standard 
error  of 
estimation 

— 

Coefficient  of 
determination 

Polynomial 

0.26  X  lO"^  X 

0.193 

0.978 

Superpolynomial 

0.47  X  lo'*  X 

0.255 

0.962 

1  Exponential 

1.05  X 

0.488 

0.860 
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Table  12.  Statistical  analysis  of  the  Balas-Chrlstofldes  algorithm 

50  <  n  <  325 


Type  of 
function 

Best  fit 

1'  . —-I.  . . 

Standard 
error  of 
estimation 

Coefficient  of 
determination 

Polynomial 

0.5  X  10‘^  X 

0.361 

0.962 

Superpolynomial 

0.87  X  10’^  X 

0.260 

0.980 

Exponential 

0.85  X  10“^  X 

0.199 

0.989 

These  results  suggest  that  in  the  limited  range  of  n  for  which  the 
algorithms  were  tested  (40  <  n  <  180  for  SST,  40  <  n  <  240  for  CT,  and  50  <  n  <  325 
for  BC),  their  behavior  can  be  almost  equally  well  described  by  any  of  the  three 
types  of  functions  considered.  Although  the  rankings  given  by  the  coefficient  of 
determination  seem  to  be  polynomial/ superpolynomlal/exponential  for  SST  and  CT, 
versus  exponential/ superpolynomial/pol)momial  for  BC,  the  differences  between 
the  coefficients  of  determination  for  the  three  function  types  are  too  small  in 
comparison  to  the  differences  between  the  same  coefficients  for  the  different 
algorithms,  in  order  to  attach  much  significance  to  these  rankings.  Further  caution 
and  reservations  are  in  order  because  of  the  considerable  differences  in  the 
range  of  n  over  which  the  three  codes  were  tested. 

In  an  attempt  to  obtain  a  more  meaningful  ranking  of  the  three  types  of 
approximation  curves,  the  range  of  n  for  each  of  the  three  algorithms  was  then 
broken  up  into  two  approximately  equal  parts,  and  the  same  three  function  types 
were  fitted  separately  to  the  data  in  the  lower  half  and  in  the  upper  half  of 
the  range  of  n.  The  results,  shown  in  Tables  13,  14,  15,  yield  the  same  rankings 


Table  13.  The  Smlth-Srlnivaaan- Thompson  algorithm,  with 
splitting  of  the  range  of  n 


r  ■ 

1 

j 

120  <  n  <  180 

— 

1 

j 

Polynomial 

0.14  X  10“^  X 

0.5183 

0.400 

Superpolynomial 

0.43  X  10“^  X 

0.5155 

0.406 

Exponential 

4.48  X 

0.5041 

0.432  1 

Table  14. 

The  Carpaneto-Toth  algorithm 
splitting  of  the  range 

,  with 
of  n 

Standard 

Type  of 

error  of 

Coefficient  of 

function 

Best  fit 

estimation 

determination 

40  <  n  <  120 

Polynomial 

0.45  X  10‘^  X 

0.116 

0.990 

Super polynomial 

0.12  X  10-^  X 

0.120 

0.989 

Exponential 

0.33  X  ,0-<>36*« 

0.237 

0.959 

140  <  n  <  240 

Polynomial 

0.22  X  lO"^  X 

• 

0.141 

0.698 

Superpolynomial 

0.9  X 

0.138 

0.708 

Exponential 

9.0  X 

0.128 

0.749 
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Table  15.  The  Balas-Christofldes  algorithm,  with 
splitting  of  the  range  of  n 


* 

Type  of 
function 

Best  fit 

Standard 
error  of 
estimation 

Coefficient  of 
determination 

■ 

1 

50  <  n  <  175 

i 

Polynomial 

0.14  X  lO"^  X 

0.288 

1 

i 

0.937  1 

1 

Superpolynomlal 

0.74  X  lo"^  X  n0-2671ogn 

0.242 

! 

0.956 

Exponential 

0.54  X  10-'  X  n0-0245n 

0.120 

0.989 

200  <  n  <  325 

Polynomial 

0.5  X  lO"'  X  a 

0.100 

0.984 

Superpolynomlal 

O.U  X  10-3  X  aO-SSSlog" 

0.095 

0.986 

Exponential 

0.199  X 

0.087 

0.988 

as  before  for  the  lower  half  of  the  range  of  n,  but  almost  completely  reverse 
the  rankings  for  the  upper  half  of  the  range:  Ignoring  differences  of  less  than 
0.01  In  the  coefficient  of  determination,  the  exponential  function  ranks  first 
over  this  range  for  both  the  SST  and  CT  algorithms,  with  the  polynomial  and 
superp'^lynomlal  functions  tied  for  second  place;  whereas  for  the  BC  algorithm, 
all  three  functions  are  now  tied.  To  the  cautionary  note  voiced  earlier,  we 
should  now  add  the  fact  that  the  coefficient  of  determination  for  this  range  of 
n  (l.e.,  the  upper  half)  Is  considerably  weaker  for  SST  (0.40-0.43)  and  CT  (0.70-0.75) 
than  for  the  full  range  of  n,  while  for  BC  It  Is  about  the  same,  l.e.,  rather 
strong  (0.98-0.99).  The  findings  listed  above  are  supported  by  additional  statis¬ 
tical  evidence,  for  which,  as  well  as  for  the  methodological  details  of  the 
analysis,  the  reader  Is  referred  to  Bales,  McGuire  and  Toth  [1983]. 

The  conclusions  that  we  draw  from  this  statistical  analysis  are  as  follows. 


First,  over  the  limited  range  of  n  for  which  data  are  available,  the  per¬ 
formance  of  the  three  algorithms  analyzed  can  be  described  almost  equally  well 


by  each  of  the  three  function  types  considered:  polynomial,  superpolynomlal  and 
exponential.  Second,  while'  the  best  fitting  polynomial  functions  are  of  a  moderate 
degree  (ranging  between  1.4  and  4.4),  the  best  fitting  exponential  functions  have 
a  base  very  close  to  1  (ranging  between  e*^**^^^  ■  1.079  and  a  1.012).  Note 

that  an  exponential  function  of  this  type  Is  very  different  from  the  function  e^. 
While  the  value  of  the  latter  Increases  more  than  twice  whenever  the  variable  goes 
from  n  to  n  +  1,  the  value  of  1.012^^  Increases  only  by  1.2  percent  when  the  variable 
goes  from  n  to  n  +  1. 
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EXERCISES 

1.  Show  thsc,  If  the  relaxetlon  R  of  TSP  used  in  the  branch  and  bound 
procedure  of  section  1  (either  version)  has  a  finite  solution  set,  and  the 
branching  rule  Is  such  that  at  every  node  1  at  least  one  solution  to  the  re¬ 
laxed  problem  R.  becomes  Infeasible  for  all  the  successor  problems  R, R.  , 

11  Iq 

then  the  procedure  Is  finite.  For  the  rooted  tree  representation  of  the  branch 
and  bound  procedure  discussed  In  section  1,  what  Is  the  maximum  depth  of  the 
tree,  l.e.,  the  maximum  length  of  a  path  Joining  any  node  to  the  root?  Give 
a  bound  on  the  number  of  nodes  of  the  rooted  tree. 

2.  Let  X*  ^^J^  optimal  solution  to  the  assignment  problem  AP, 

and  let  AP^  be  the  assignment  problem  obtained  from  AP  by  adding  the  constraint 

X.  .  =0  for  some  (1  ,  j  )  such  that  x*  ,  »  1.  Describe  a  version  of  the 

.1  0  0  11 
0*^0  0^0 

Hungarian  method  that  starts  with  x*  and  finds  an  optimal  solution  to  AF^  In 

2 

0(n  )  steps.  (Hint:  show  that  only  one  labeling  Is  required.) 

3.  Show  that  the  "breadth  first"  rule  of  always  choosing  the  node  with 
the  best  lower  bound  produces  a  search  tree  with  a  minimum  number  of  nodes. 

If  (1)  every  node  selection  Is  uniquely  determined,  l.e.,  there  are  no  ties 
for  the  best  lower  bound;  and  (11)  the  branching  and  bounding  at  any  given 
node  Is  done  only  on  the  basis  of  Information  generated  on  the  path  from  the 
root  of  the  tree  to  the  given  node.  Constinict  examples  to  show  that  neither 
(1)  nor  (11)  Is  sufficient  by  Itself  (Fox,  Lenstra,  Rlnnooy  Kan  and  Schrage 
[1978]).  Assuming  that  conditions  (1),  (11)  are  not  satisfied,  describe  a 
subproblem  selection  rule  that  combines  some  of  the  advantages  of  "breadth 
first"  with  some  of  those  of  "depth  first"  (Forrest,  Hirst  and  Tomlin  [1974], 
Balas  [1975]). 
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4.  A  subgradient  of  a  convex  function  f(x)  at  x  ■  x  Is  a  vector  s 
such  that 


f(x)  -  f(^)  >  sCx-x'^)  ,  V  X, 

and  the  subdlfferentlal  of  f(x)  at  x  Is  the  set  of  all  subgradients  of 
f(x)  at  X  *  x'^. 

Let  J  be  the  family  of  1-trees  In  G  »  (V,E)  Introduced  In  section  3, 

and  let  X(J)  denote  the  set  of  Incidence  vectors  of  1-trees.  Show  that,  If 

Ic  Ic 

H(X  )  Is  a  1-tree  whose  Incidence  vector  x  minimizes  the  function 


E  E  (c 
l€v  j  >1 


Ij 


+  Xj  + 


Vc  k 

on  ,  and  d^^  Is  the  degree  of  node  1  In  H(X  ) ,  then  the  n-vector  whose 
components  are  d^  -  2,  l€v,  Is  a  subgradient  of  L(X)  at  •  Identify  the 

If 

subdlfferentlal  of  L(X)  at  X  . 

5.  Let  admissible  graph  defined  In  section  4  with 

respect  to  (u,  v,  w),  and  let  x  be  the  Incidence  vector  of  a  tour  H(x)  In  G^. 

Show  that  H(x)  Is  an  optimal  tour  In  G  If  x  satisfies  Inequality  (37)  with 
equality  for  all  t€T  such  that  w^  >  0.  Is  this  sufficient  condition  also 
necessary?  (Hint:  use  the  optimality  conditions  for  the  linear  program  de¬ 
fined  by  (1),  (2),  (37)  and  x^j  >  0,  ¥  l,j,  and  Its  dual.) 

6.  Show  that  bounding  procedure  1  of  section  4  generates  at  most 
(h-l)(h+2)/2  cutsets,  where  h  Is  the  number  of  subtours  In  the  optimal  solu¬ 
tion  X  to  AP.  (Hint:  use  the  following  facts:  (1)  any  node  of  a  strongly 
connected  component,  hence  of  a  subtour.  Is  reachable  from  any  other  node; 

(11)  every  directed  cutset  that  Is  generated  adds  to  A^  at  least  one  new 
arc  joining  some  subtour  to  some  other  subtour;  and  (111)  when  two  subtours 

are  joined  by  arcs  In  both  directions,  they  form  a  strongly  connected  component.) 
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7.  Show  that,  If  la  tha  lowar  bound  on  v(TSP)  obtained  by  bounding 
procedure  1,  the  lower  bound  generated  by  bounding  procedure  2  Is 


B2  -  +  E 


t6T„ 


(Hint:  use  the  fact  that  If  the  cost  of  each  arc  In  the  set  U  (S  ,  S  ) 

t€T2  ^  ^ 

Is  Increased  by  ^,2*  then  the  value  of  the  solution  x  (hence  of  the  solution 

(u,v)  to  the  dual  of  AP  obtained  at  the  end  of  procedure  2)  Is  v(AP)  +  E  js  ||j.  . ) 

t€T2 

8.  Let  k  be  an  articulation  point  of  the  admissible  graph  G^,  let  be 

the  node  set  of  one  of  the  components  of  and  consider  the  two  directed 

cutsets 

-  {S^,  V\Sj.U{k}},  k"-  {V\S^U{k},  S^]. 

Show  that  the  Inequality 

i  m  X  >  1 

(l,j)€K^UK^^  J  ” 

Is  the  sum  of  the  Inequality  (5)  for  S  -  S^U  {k],  the  Inequality  (3)  for  S  «  S^, 
and  the  equations  (2)  for  all  l€S^  and  j^S^.. 

9.  Formulate  the  n-path  relaxation  of  the  TSP  discussed  In  section  5  for 
both  the  asymmetric  and  the  symmetric  cases,  with  a  Lagrangean  function  Involving 
the  equations  (2)  (In  the  asymmetric  case)  or  (7)  (In  the  symmetric  case).  Give 
some  examples  of  side  constraints,  l.e.,  extra  conditions,  that  this  relaxation 
of  TSP  can  accommodate  but  the  1-arborescence  or  1-tree  relaxations  can  not. 
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This  paper  reviews  the  state  of  the  art  In  enumeratlve  solution  methods  for  the) 
traveling  salesman  problem  (TSP).  The  Introduction  (Section  1)  discusses  the  j 
main  Ingredients  of  branch  and  bound  methods  for  the  TSP.  Sections  2,  3,  and  j 
4  discuss  classes  of  methods  based  on  three  different  relaxations  of  the  TSP:  * 
the  assignment  problem  with  the  TSP  cost  function,  the  1-tree  problem  with  a  | 
Lagrangean  objective  function,  and  the  assignment  problem  with  a  Lagrangean  j 
objective  function.  Section  5  briefly  reviews  some  other  relaxations  of  the  1 
TSP,  while  Section  6  discusses  the  perfeHoenee  af  some  st»te-ef  the  art  j 

1.JT1  ■niTin.i  A*  I  unu  A«  i«  anALCTM  (cont  d) 

I  JAM  71 


toinoN  Of  I  NOV  M  IS  essoLCTt 

i/H  OlOl'OU'SSOI  > 


SSeueiTV  CI.ASWI'TcaTION  or  l^is  RAOe  (fttan  aaia  tmarmt) 


computer  codes.  Besides  material  from 
the  results  and  statistical  analysis  of 
for  the  purposes  of  this  review. 


the  literature,  the  paper  also  Includes 
some  computational  experiments  designed 


